This Page Is Inserted by IFW Operations 
and is not a part of the Official Record 

BEST AVAILABLE IMAGES 

Defective images within this document are accurate representations of 
the original documents submitted by the applicant. 

Defects in the images may include (but are not limited to): 

• BLACK BORDERS 

• TEXT CUT OFF AT TOP, BOTTOM OR SIDES 

• FADED TEXT 

• ILLEGIBLE TEXT 

• SKEWED/SLANTED IMAGES 

• COLORED PHOTOS 

• BLACK OR VERY BLACK AND WHITE DARK PHOTOS 

• GRAY SCALE DOCUMENTS 

IMAGES ARE BEST AVAILABLE COPY. 



As rescanning documents will not correct images, 
please do not report the images to the 
Image Problem Mailbox. 




FIGURE 133 



^^^^TCGGTGATACGGCGTCTTGCCACeGGGCCTGTCAGTTGACCTACCCCTTGCACACCTACCCT^ 
GGAAGAGGAGTTGTACGCATGTCAGAGAGGTTGCAGGCTGTTTTCTVATTTGTCAGTTT^ 
ATGGAATTGACTTAAATCGAACTAAATTGGAATGTGAATCTGCTVTGTACTVGAAGCAT^ 
TCTGATGAGCAATATGCTTGCCATCTTGGTTGCCAGAATCAGCTGCCATTCGCTGAACTGAGACA 
AGAACAACTTATGTCCCTGATGCCAAAAATGCACCTACTCTTTCCTCTAACTCTGGTGAGGTCAT 
TCTGGAGTGACATGATGGACTCCGC 



FIGURE 134 

CTVCmCTGGCCGGATCTTTTAGAGTCCTTTGACCTTGACCAAGGGTCNGGAAAACAGC^ 

GAGCTGCTGTGACAGAGGGAACAAGATGGCGGCGCCGAAGGGAGCCTTlXX^ 

GGGGCTCCCGCCGCTGCTGCTGCTGACCATGGCCTTGGCCGGAGGTTCGGGGACCGCTTCGGCTG 

AAGCATTTGACTCGGTCTTGGGTGATACGGCGTCTTGCCACCGGGCCTGTCAGTTGACCTACCCC 

TTGCACACCTACCCTAAGGAAGAGGAGTTGTACGCATGTCAGAGAGGTTGCAGGCTGTTTTCAAT 

TTGTCAGTTTGTGGATGATGGAATTGACTTAAATCGAACTAAATTGGAATGTGAATCTGCATG^^ 

CAGAAGCATATTCCCAATCTGATGAGCAATATGCTTGCCATCTTGGTTGCCAGAATCAGCTGC 

TTCGCTGAACTGAGACAAGAACAACTTATGTCCCTGATGCCAAAAATGCACCTACTGTTTCCTCT 

AACTCTGGTGAGGTCATTCTGGAGTGACATGATGGACTCCGC 




FIGURE 135 

GCGAGGTGXSCGATCGCTGAGAGGCAGGAGGGCCGAGGCGGGCCTGGGAGGCGGCCCGGAGGTGC^ 

GCGCCGCTGGGGCCGGCCCGCACGGGCTTCATCTGAGGGCGCACGGCCCGCGACCGAGCGTGC^ 

ACTGGCCTCCCT^GCGTGGGGCGACAAGCTGCCGGAGCTGCA ATGG GCCGCGGCTGGGGATTCTT 

GTTTGGCCTCCTGGGCGCCGTGTGGCTGCTCAGCTCGGGCC7VCGGAGAGGAGC7VGCCCCCGGAGA 

CAGCGGCACAGAGGTGCTTCTGCCAGGTTAGTGGTTACTTGGATGATTGTACCTGTGATGTTG^ 

ACCATTGATAGATTTAATAACTACAGGCTTTTCCCAAGACTACAAAAACTTCTTGAA^ 

CTTTAGGTATTACT^GGTAAACCTGAAGAGGCCGTGTCCTTTCTGGAATGACATCAGCCA 

GAAGAAGGGACTGTGCTGTCAAACCATGTCAATCTGATGAAGTTCCTGATGGAATTAAATCTGCG 

AGCTACAAGTATTCTGAAGAAGCCAATAATCTCATTGAAGAATGTGAACAAGCTGAACGACTTGG 

AGCAGTGGATGAATCTCTGAGTGAGGAAA.CACAGAAGGCTGTTCTTCAGTGGACCAAGCATGATG 

ATTCTTCAGATAACTTCTGTGAAGCTGATGACATTCAGTCCCCTGAAGCTGAATATGTAGATTTG 

CTTCTTAATCCTGAGCGCTACACTGGTTACAAGGGACCAGATGCTTGGAAAATATGGAATGTCAT 

CTACGAAGAAAACTGTTTTAAGCCACAGACAATTAAAAGACCTTTAAATCCTTTGGCTTCTGGTC 

AAGGGACAAGTGAAGAGAACACTTTTTACAGTTGGCTAGAAGGTCTCTGTGTAGAAAAAAGAGCA 

TTCTACAGACTTATATCTGGCCTACATGCAAGCATTAATGTGCATTTGAGTGCAAGATATCTTTT 

ACAAGAGACCTGGTTAGAAAAGAAATGGGGACACAACATTACAGAATTTCAACAGCGATTTGATG 

GAATTTTGACTGAAGGAGAAGGTCCAAGAAGGCTTAAGAACTTGTATTTTCTCTACTTAATAGAA 

CTAAGGGCTTTATCCAAAGTGTTACCATTCTTCGAGCGCCCAGATTTTCAACTCTTTACTGGAAA 

TAAAATTCAGGATGAGGAAAACAAAATGTTACTTCTGGAAATACTTCATGAAATCAAGTCATTTC 

CTTTGCATTTTGATGAGAATTCATTTTTTGCTGGGGATAAAAAAGAAGCACACAAACTAAAC^^ 

GACTTTCGACTGCATTTTAGAAATATTTCAAGAATTATGGATTGTGTTGGTTGTTTTAAATGTCG 

TCTGTGGGGAAAGCTTCAGACTCAGGGTTTGGGCACTGCTCTGAAGATCTTATTTTCTGAGAAAT 

TGATAGCAAATATGCCAGAAAGTGGACCTAGTTATGAATTCCATCTAACCAGAO^AGAA^ 

TCATTATTCAACGCATTTGGAAGAATTTCTACAAGTGTGAAAGAATTAGAAAACTTCAGGAACTT 

GTTACAGAATATTCA TTAA AGAAAACAAGCTGATATGTGCCTGTTTCTGGACAATGGAGGCGAAA 

GAGTGGAATTTCATTCAAAGGCT^TAATAGCAATGACAGTCTTAAGCCAAACATTTTATATAAAGT 

TGCTTTTGTAAAGGAGAATTATATTGTTTTAAGTAAACACATTTTTAAAAATTGTGTTAAGTCTA 

TGTATAATACTACTGTGAGTAAAAGTAATACTTTAATAATGTGGTACAAATTTTAAAGOT 

TTQAATAAAAGGAGGATTATCAAATTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^^ 



FIGURE 13 6 



</usr/seqdb2/sst/DNA/Dnaseqs .min/ss . DNA53974 
<subunit 1 of 1, 468 aa, 1 stop 
<MW: 54393, pi; 5.63, NX(S/T): 2 

MGRGWGFLFGLLGAVWLLSSGHGEEQPPETAAQRCFCQVSGYLDDCTCDVETIDRFNNYRiiFPRL 
QKLLESDYFRYYKVNLKRPCPFWNDISQCGRRDCAVKPCQSDEVPDGIKSASYKYSEEANNLIEE 
CEQAERLGAVDESLSEETQKAVLQWTKHDDSSDNFCEADDIQSPEAEYVDLLLNPERYTGYKGPD 
AWKIWNVIYEENCFKPQTIKI^PLNPIiASGQGTSEENTFYSWLEGLCVEKRAFYRLISGLHASIlW 
HLSARYLLQETWLEKKWGHNITEFQQRFDGILTEGEGPRRLKNLYFLYLIELRALSKVLPFFERP 
DFQLFTGNKIQDEENKMLLLEILHEIKSFPLHFDENSFFAGDKKEAHKLKEDFRLHFRNISRIMD 
CVGCFKCRLWGKLQTQGLGTALKILFSEKLIANMPESGPSYEFHLTRQEIVSLFNAFGRISTSVK 
ELENFRNLLQNIH 

Important features: 
Signal peptide: 

amino acids 1-23 

N-glycosylation site. 

amino acids 280-283 and 384-387 

Amidation site. 

amino acids 94-97 

Glycosaminoglycah attachment site. 

amino acids 20-23 and 223-226 

Aminotransferases class -V pyridoxal -phosphate 

amino acids 216-222 

lnterleukin-7 proteins 

amino acids 338-343 



FIGURE 137 

GCTGGAAATATGC3ATGTCATCTACGAGAAACTGTTTTAAGCCACAGACAATTAAAAGACCTTTAA 

ATCCTTTGGCTTCTGGTCAAGGGACAAGTGAAGAGNACACTTTTTACAGTTGGCTAGAAGGTCTC 

TGTGTAGAAAAAAGAGCATTCTACAGACTTATATCTGGCCTACT^TGCAAGCATTAATGTGC^ 

GAGTGCAAGATATCTTTTACAAGAGACCTGGTTAGAAAAGAAATGGGGAGT^CAAO^TTAC^ 

TTNAACAGCGATTTGATGGAATTTTGACTGAAGGAGAAGGTCCAAGAAGGCTTAAGAACTTGTAT 

TTTCTCTACTTAATAGAACTAAGGGCTTTATCCAAAGTGTTACCATTCTTNGAGCGCCCAGATTT 

TCAACTNTTTACTGGAAATAAAATTCAGGATGAGGNAAACAAAATGTTACTTT^ 

ATGAAATCAAGTCATTTCCTTTGCATTTTGATGAGAATTCATTTTTTTGCTG 




^ FIGURE 138 

CGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGTTGGGAGGGGGCAGGATGGGAGGGAA^ 

GAAGAAAACAGAAAAGGAGAGGGACAGAGGCCAGAGGACTTCTC7\.TACTGGACA 

GGC ATGG AACTCCCCTTCGTCACTCACCTGTTCTTGCCCCTGGTGTTCCTGACAGGTCTCTGCTC 

CCCCTTTAACCTGGATGAACATCT^CCCACGCCTATTCCCAGGGCCACCAGAAGCTGAATTT^ 

ACAGTGTCTTACAACATGTTGGGGGTGGACAGCGATGGATGCTGGTGGGCGCCCCCTGGGATGC^ 

CCTTCAGGCGACCGGAGGGGGGACGTTTATCGCTGCCCTGTAGGGGGGGCCCACAATGCCCC^ 

TGCCAAGGGCCACTTAGGTGACTACCAACTiSGGAAATTCATCTCATCCTGCTGTGA^ 

TGGGGATGTCTCTGTTAGAGACAGATGQTGATGGGGGATTCATGGTGAG CTAAG GAGAGGGTGGT 

GGCAGTGTCTCTGAAGGTCCATAAAAGAAATUUVGAGAAGTGTGGTAAGGGAAAATGGTCTGTGTG 

GAGGGGTCAAGGAGTTAAAAACCCTAGAAAGCAAAAGGTAGGTAATGTCAGGGAGTAGTCTTCAT 

GCCTCCTTCAACTGGGAGCATGTTCTGAGGGTGCCCTCCCAAGCCTGGGAGTAACTATTTCCCCC 

ATCCCCAGGCCTGTGCCCCTCTCTGGTCTCGTGCTTGTGGCAGCTCTGTCTTCAGTTCTGGGATA 

TGTGCCCGTGTGGATGCTTCATTCCAGCCTCAGGGAAGCCTGGCACCCACTGCCCAACGTGAGCC 

AGAGGAAGGCTGAGTACTTGGTTCCCAGAAGGAGATACTGGGTGGGAAAAAGATGGGGCAAAGCG 

GTATGATGCCTGGCAAAGGGCCTGCATGGCTATCCTCATTGCTACCTAATGTGCTTGCAAAAGCT 

CCATGTTTCCTAACAGATTCAGACTCCTGGCCAGGTGTGGTGGCCCACACCTGTAATTCTAGCAC 

TTTGGGAGGCCAAGGTGGGCAGATCACTTGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGG 

TGAAACTCCATCTCTACTAAAAAAAAAAAAATACAAAAATTAGCTGGGTGCGCTAGTGCATGCCT 

GTAATCTCATCTACTCGGGAGGCTAAGACAGGAGACTCTCACTTCAACCCAGGAGGTGGAGGTTG 

CGGTGAGCCAAGATTGTGCCTCTGCACTCTAGCGTGGGTGACAGAGTAAGCGAGACTCCATCTCA 

AAAATAATAATAATAATAATTCAGACTCCTTATCAGGAGTCCATGATCTGGCCTGGCACAGT7VAC 

TCATGCCTGTAATCCCAACATTTTGGGAGGCCAAGGCAGGAGGATTGCTTGAGGTCTGGAGGTTT 

GAGACCAGCCTGGGCAACATAGAAAGACCCCATCTCTAAATAAAtGTTTTAAAAAT 




FIGURE 13 9 

></usr/seqdb2/sst/DNA/Dnaseqs -min/ss .DNA57039 
xsubunit 1 of 1, 1-24 aa, 1 stop 
><MW: 13352, pi: 5.99, NX(S/T): 1 

MELPFVTHLFLPLVFLTGLCSPFNLDEHHPRLFPGPPEAEFGYSVLQHVGGGQRWMLVGAPWDGP 
SGDRRGDVYRCPVGGAHNAPCTVKGHLGDYQLGNSSHPAVNMHLGMSLLETDGDGGFMVS 

Important features : 
Signal . peptide : 

amino acids 1-22 

Cell attachment sequence. 

amino acids 70-73 

N-glycosylation site. 

amino acids 98-101 



Integrins alpha chain proteins 

amino acids 67-81 



FIGURE 140 

CACAGTTCCCOVCCATCACTCNTCCCT^TTCCTTCCAACTTTATTTTTAGCTTGCCATTa 

GGCAGGATGGGAGGGAAAGTGAAGAAAACAGAAAAGGAGAGGGACTVGAGGCCAGAG^ 

TACTGGACAGAAACCGATCAGGCATGGAACTCCCCTTCGTCACTCACCTGTTCTTGCCCCTGGTG 

TTCCTGACAGGTCTCTGCTCCCCCTTTAACCTGGATGAACATCACCCACGCCTATTCCCAGGGCC 

ACCAGAAGCTGAATTTGGATACAGTGTCTTACAACTVTGTTGGGGGTGGACAGC^ 

TGGGCGCCCCCTGGGATGGGCCTTCAGGCGACCGGAGGGGGGACGTTTATCGCTGCCCTGTAGGG 

GGGGCCCACAATGCCCCATGTGCCAAGGGCC7^CTTAGGTGACTACCAACTGGGA?^TT 

TCCTGCTGTGAATATGCACCTGGGGATGTCTCTGTTAGAGACAGATGGTGATGG 




FIGURE 141 

AAAGTTACATTTTCTCTGGAACTCTCCTAGGCCACTCCCTGCTGATGCAACATCTGGGTTTGGGC 

AGAAAGGAGGGTGCTTCGGAGCCCGCCCTTTCTGAGCTTCCTGGGCCGGCTCTAGAACAATTCAG 

GCTTCGCTGCGACTCAGACCTCAGCTCCAACATATGCATTCTGAAGAAAGATGGCTGAGATGGAC 

AGAATGCTTTATTTTGGAAAGAAACAATGTTCTAGGTCAAACTGAGTCTACCAAATGCAGACTTT 

CACAATGGTTCTAGAAGAAATCTGGACAAGTCTTTTCATGTGGTTTTTCTACGCATTGATTCCAT 

GTTTGCTCACAGATGAAGTGGCCATTCTGCCTGCCCCTCAGAACCTCTCTGTACTCTCAACCAAC 

ATGAAGCATCTCTTGATGTGGAGCCCAGTGATCGCGCCTGGAGAAACAGTGTACTATTCTGTCGA 

ATACCAGGGGGAGTACGAGAGCCTGTACACGAGCCACATCTGGATCCCCAGCAGCTGGTGCTCAC 

TCACTGAAGGTCCTGAGTGTGATGTCACTGATGACATCACGGCCACTGTGCCATACAACCTTCGT 

GTCAGGGCCACATTGGGCTCACAGACCTCAGCCTGGAGCATCCTGAAGCATCCCTTTAATAGAAA 

CTCAACCATCCTTACCCGACCTGGGATGGAGATCACCAAAGATGGCTTCCACCTGGTTATTGAGC 

TGGAGGACCTGGGGCCCCAGTTTGAGTTCCTTGTGGCCTACTGGAGGAGGGAGCCTGGTGCCGAG 

.GAACATGTC?^AAATGGTGAGGAGTGGGGGTATTCCAGTGCACCTAGAAACCATGGAGC 

TGCATACTGTGTGAAGGCCCAGACATTCGTGAAGGCCATTGGGAGGTACAGCGCCTTCAGCCAGA 

CAGAATGTGTGGAGGTGCAAGGAGAGGCCATTCCCCTGGTACTGGCCCTGTTTGCCTTTGTTGGC 

TTCATGCTGATCCTTGTGGTCGTGCCACTGTTCGTCTGGAAAATGGGCCGGCTGCTCCAGTACTC 

CTGTTGCCCCGTGGTGGTCCTCCCAGACACCTTGAAAATAACCAATTCACCCCAGAAGTTAATCA 

GCTGCAGAAGGGAGGAGGTGGATGCCTGTGCCACGGCTGTGATGTCTCCTGAGGAACTCCTCAGG 

GCCTGGATCTCATAGGTTTGCGGAAGGGCCCAGGTGAAGCCGAGAACCTGGTCTGCATGACATGG 

AAACCATGAGGGGACAAGTTGTGTTTCTGTtTTCCGCCACGGACTiJ^Gi^^ 

GAGCCTGTTGTCTACAAGTCTAGAAGCAACCATCAGAGGCAGGGTGGTTTGTCTAACAGAACACT 

GACTGAGGCTTAGGGGATGTGACCTCTAGACTGGGGGCTGCCACTTGCTGGCTGAGCAACCCTGG 

GAAAAGTGACTTCATCCCTTCGGTCCTAAGTTTTCTCATCTGTAATGGGGGAATTACCTACACAC 

CTGCTAAACACACACACACAGAGTCTCTCTCTATATATACACACGTACACATAAATACACCCAGC 

ACTTGCAAGGCTAGAGGGAAACTGGTGACACTCTACAGTCTGACTGATTCAGTGTTTCTGGAGAG 

CAGGACATAAATGTATGATGAGAATGATCAAGGACTCTACACACTGGGTGGCTTGGAGAGCCCAC 

TTTCCCAGAATAATCCTTGAGAGAAAAGGAATCATGGGAGCAATGGTGTTGAGTTCACTTCAAGC 

CCAATGCCGGTGCAGAGGGGAATGGCTTAGCGAGCTCTACAGTAGGTGACCTGGAGGAAGGTCA 

AGCCACACTGAAAATGGGATGTGCATGAACACGGAGGATCCATGAACTACTGTAAAGTGTTGACA 

GTGtGTGCACACTGCAGACAGCAGGTGAAATGTATGTGTGCAATGCGACGAGAATGCAGAAGTCA 

GTAACATGTGCATGTTTGTTGTGCTCCTTTTTTCTGTTGGTAAAGTACAGAATTCAGCAAATAAA 

AAGGGCCACCCTGGCCAAAAGCGGTAAAAAAAAAAAAAAAA 



FIGURE 142 



</usr/seqdb2/sst/DNA/Dnaseqs .min/ss , DNA57033 
<subunit 1 of 1, 311 aa, 1 stop 
<MW: 35076, pi: 5.04, NX(S/T): 2 

MQTFTMVLEEIWTSLFMWFFYALIPCLLTDEVAILPAPQNLSVLSTNMKHLLMWSPVIAPGETVY 
YSVEYQGEYESLYTSHIWIPSSWCSLTEGPECDVTDDITATVPYNLRVRATLGSQTSAWSILKHP 
FNRNSTILTRPGMEITKDGFHLVIELEDLGPQFEFLVAYWRREPGAEEHVKMWSGGIPVHLETM 
EPGAAYCVKAQTFVKAIGRYSAFSQTECVEVQGEAI PLVLALFAFVGFML I LVWPLFVWKMGRL 
LQ YS CC P VyVLPDTLKI TNS PQKL I S CRREE VDACATAVMS PEELLRAW I S 

Important features : 
Signal peptide: 

amino acids 1-29 

Transmembrane domain: 

amino acids 230-255 

N-glycosylation site. 

amino acids 40-43 and 134-137 

Tissue factor proteins. 

amino acids 92-119 

Integrins alpha chain proteins 

amino acids 232-262. 



FIGURE 143 

tcctgctgatgcacatctgggtttggcaaaaggaggttgcttcgagccgccctttctagcttcct 

ggccggctctagaacaattcaggcttcgctgcgactagacctcagctccaacatatgcatt 

agaaagatggctgagatgacagaatgctttattttggaaagaaacaatgttctaggtcaaactga 

gtctaccaaatgcagactttcacaatggttctagaagaaatctgg^ 

tttgtacgcattgattccatgtttgctpacagatgaagtggccattctgcctgcccctcagaacc 

tctctgtactctcaaccaacatgaagcatctcttgatgtggagcccagtgatcgcgcctggag;^ 

acagtgtactattctgtcgaataccagggggagtacgagagcctgtacacgagccacatctggat 

ccccagcagctggtgctcactcactgaaggtcctgagtgtgatgtcactgatgacatcacggcca 

ctgtgccatacaacctttgtgtcagggccacattgggctcacagacctcagcctggagcatcctg 

aagcatccctttaatagaaactcaaccatccttacccgacctgggatggagatcaccaaagatgg 

cttncacctggttattgagctggaggacctggggccccagtttgagttgcttgtggcctantgga 

ggaggggcgaaccccttgcggcgcaaggggttngcgaaccccttgcggccgctggggtatctctc 

gagaaaagagaggcccaatatgacccacatactcaatatggacgaantgctattgtccacctgtt 

tgagtggcgctgggttgat 




FIGURE 144 



CCCACGCGTCCGCCCACGCGTCCGAGGGACAAGAGAGAAGAGAGACTGAAACAGGGAGAAGAGGC 

AGGAGAGGAGGAGGTGGGGAGAGC7VCGAAGCTGGAGGCCGACACTGAGGGAGGGCGGGAGGAGGT 

GAAGAAGGAGAGAGGGGAGAAGAGGCAGGAGCTGGAAAGGAGAGAGGGAGGAGGAGGAGGAGATG 

CGGGATGGAGACCTGGAGTTAGGTGGCTTGGGAGAGCTTAATGAAAAGAGAACGGAGAGK^ 

TGGGTTAGGAACO^GAGGTAGCCCTGTGGGCAGCAGAAGGCTGAGAGQAGTAGGAAGATC^ 

GCTAGAGGGAGACTGGAGGGTTCCGGGAAAAGAGO^GAGGAAAGAGQAAAGACACT^G^ 

GAGAGAGAAGAAGAGTGGGTTTGAAGGGCGGATCTCT^GTCCCTGGCTGCTTTGGCTVTTTGGC^ 

CTGGGACTCCCTGTGGGGAGGAGAGGAAAGCTGGAAGTCCTGGAGGGACAGGGTCCCAGAAGGAG 

GGGACAGAGGAGCTGAGAGAGGGGGGCAGGGCGTTGGGCAGGGGTCCCTCGGAGGCCTCCTGGGG 

ATGGGGGCTGCAGCTCGTCTGAGCGCCCCTCGAGCGCTGGTACTCTGGGCTGCACTGGGGGCAGC 

AGCTCACATCGGACCAGCACCTGACCCCGAGGACTGGTGGAGCTACAAGGATAATCTCCAGGGAA 

ACTTCGTGCCAGGGCCTCCTTTCTGGGGCCTGGTGAATGCAGCGTGGAGTCTGTGTGCTGTGGGG 

AAGCGGCAGAGCCCCGTGGATGTGGAGCTGAAGAGGGTTCTTTATGACCCCTTTCTGCCCCCATT 

AAGGCTCAGCACTGGAGGAGAGAAGCTCCGGGGAACCTTGTACAACACCGGCCGACATGTCTCCT 

TCCTGCCTGCAGCCCGACCTGTGGTCAATGTGTCTGGAGGTCCCCTCCTTTACAGCCACCGACTC 

AGTGAACTGCGGCTGCTGTTTGGAGCTCGCGACGGAGCCGGCTCGGAACATCAGATCAACCACCA 

GGGCTTCTCTGCTGAGGTGCAGCTCATTCACTTCAACCAGGAACTCTACGGGAATTTCAGCGCTG 

CCTCCCGCGGCCCCAATGGCCTGGCCATTCTCAGCCTCTTTGTCAACGTTGCCAGTACCTCTAAC 

CCATTCCTCAGTCGCCTCCTTAACCGCGACACCATCACTCGCATCTCCTACAAGAATGATGCCTA 

CTTTCTTCAAGACCTGAGCCTGGAGCTCCTGTTCCCTGAATCCTTCGGCTTCATCACCTATCAGG 

GCTCTCTCAGCACCCCGCCCTGCTCCGAGACTGTCACCTGGATCCTCATTGACCGGGCCCTCAAT 

ATCACCTCCCTTCAGATGCACTCCCTGAGACTCCTGAGCCAGAATCCTCCATCTCAGATCTTCCA 

GAGCCTCAGCGGTAACAGCCGGCCCCTGCAGCCCTTGGCCCACAGGGCACTGAGGGGCAACAGGG 

ACCCCCGGCACCCCGAGAGGCGCTGCCGAGGCCCCAACTACCGCCTGCATGTGGATGGTGTCCCC 

CATGGTCGC TGAG ACTCCCCTTCGAGGATTGCACCCGCCCGTCCTAAGCCTCCCCACAAGGCGAG 

GGGAGTTACCCCTAAAACAAAGCTATTAAAGGGACAGAATACTTA 



FIGURE 145 



</usr/seqdb2/sst/DNA/Dnaseqs .min/ss . DNA34353 
<subunit 1 of 1, 328 aa, 1 stop 
<MW: 36238, pi: 9.90, NX(S/T): 3 

MGAAARLSAPRALVLWAALGAAAHIGPAPDPEDWWSYKDNLQGNFVPGPPFWGLVNAAWSLCAVG 
KRQSPVDVELKRVLYDPFLPPLRLSTGGEKLRGTLYNTGRHVSFLPAPRPVVNVSGGPLLYSHRL 
SELRLLFGARDGAGSEHQINHQGFSAEVQLIHFNQELYGNFSAASRGPNGIAILSLFVNVASTSN 
PFLSRLLNRDTITRISYKNDAYFLQDLSLELLFPESFGFITYQGSLSTPPCSETVTWILIDRALN 
ITSLQMHSLRLLSQNPPSQIFQSLSGNSRPLQPLAHRALRGNRDPRHPERRCRGPNYRLHVDGVP 
HGR 

Important features : 
Signal peptide: 

amino acids 1-23 

Transmembrane domain: 

amino acids 177-199 

N-glycosylation site. 

amino acids 118-121, 170-173 and 260-263 

Eukaryotic- type carbonic anhydrases proteins 

amino acids 222-270, 128-164 and 45-92 



FIGURE 146 

ggcgcctggttctgcgcgtactggctgtacggagcaggagcaagaggtcgccgccagcctccgccgccgagcctcgtt 
cgtgtccccgcccctcgctcctgcagctactgctcagaaacgctggggcgcccaccctggcagactaacgaagcagct 
cccttcccaccccaactgcaggtctaattttggacgctttgcctgccatttcttccaggttgagggagccgcagaggc 
ggaggctcgcgtattcctgcagtcagcacccacgtcgcccccggacgctcggtgctcaggcccttcgcgagcggggct 
ctccgtctgcggtcccttgtgaaggctctgggcggctgcagaggccggccgtccggtttggctcacctctcccaggaa 
acttcacactggagagccaaaaggagtggaagagcctgtcttggagattttcctggggaaatcctgaggtcattcatt 
atga agtgtaccgcgcgggagtggctcagagtaaccacagtgctgttcatggctagagcaattccagccatggtggtt 
cccaatgccactttattggagaaacttttggaaaaatacatggatgaggatggtgagtggtggatagccaaacaac^ 
gggt^aaagggccatcacagact^tgacatgct^gagtattttggaccttcat;^ 

acagcctctaatatggagtatatgacatgggatgtagagctggaaagatctgcagaatcctgggctgaaagttgcttc 

tgggaacatggacctgcaagcttgcttccatcaattggacagaatttgggagcacactggggaagata 

acgtttcatgtacaatcgtggtatgatgaagtga/uvgactttagctacccatatqaacatgaatgcaacccatattgt 

ccattcaggtgttctggccctgtatgtacacattatacacaggtcgtgtgggcaactagtaacagaatcggttgtgc 

attaatttgtgtcataacatgaact^tctgggggcagatatggcccaaagctgtctacctggtgtgcaat^ 

AAGGGAAACTGGTGGGGCCATGCCCCTTACAAACT^TGGGCGGCCCTGTTCTGCTTGCCC^ 

tgtagagaaaatctgtgctactu^gaagggtcagacaggtattatccccctcgagaagag^^ 

cgacagcagtcacaagtccatgacacccatgtccggacaagatcagatgatagtagcagaaatgaagtcataagcgca 

cagcaaatgtcccaaattgtttcttgtgaagtaagattaagagatcagtgcaaaggaacaacctgcaataggtacgaa 

tgtcctgctggctgtttggatagtaaagctaaagttattggcagtgtacattatgaaatgcaatccagcatctgtaga 

gctgcaattcattatggtataatagacaatgatggtggctgggtagatatcactagacaaggaagaaagcattatttc 

atcaagtccaatagaaatggtattcaaacaattggcaaatatcagtctgctaattccttcacagtctctaaagtaaca 

gttcaggctgtgacttgtgaaacaactgtggaacagctctgtccatttcataagcctgcttcacattgcccaagagta 

tactgtcctcgtaactgtatgcaagcaaatccacattatgctcgtgtaattggaactcgagtttattctgatctgtcc 

agtatctgcagagcagcagtacatgctggagtggttcgaaatcacggtggttatgttgatgtaatgcctgtggacaaa 

agaaagacctacattgcttcttttcagaatggaatcttctcagaaagtttacagaatcctccaggaggaaaggcattc 

agagtgtttgctgttgt gtga aactgaatacttggaagaggaccataaagactattccaaatgcaatatttctgaatt 

ttgtataaaactgtaacattactgtacagagtacatcaactattttcagcccaaaaaggtgccaaatgcatataaatc 

ttgataaacaaagtctataaaataaaacatgggacattagctttgggaaaagtaatgaaaatataatggttttagaaa 

tcctgtgttaaatattgctatattttcttagcagttatttctacagttaattacatagtcatgattgttctacgtttc 

atatattatatggtgctttgtatatgccactaataaaatgaatctaaacattgaatgtgaatggccctcagaaaatca 

tctagtgcatttaaaaataatcgactctaaaactgaaagaaaccttatcacattttcccctvgttc^ 

taccaactccaaataatctcaaataattttccacttaataactgtaaagtttttttctgttaatttaggcatat^ 

tattaaattctgatattgcacttcttattttatataaaataatcctttaatatccaaatgaatctgttaaaatgtttg 

attccttgggaatggccttaaaaatt^tgtaataaagtcagagtggtggtatgaaaacattcctagtgatcatgtag 

taaatgtagggttaagcatggacy^gccagagctttctatgtactgttaaaattgaggtcacatattttcttttgtatc 

ctggct^aatactcctgcaggcctvggaagtataatagcaaaaagttgaacaaagatgaactaatgtattacattac^ 




GGTGGATTTACATATTAAATTTTTTCTGCTGGTGGATAAACATTAAAATTAATCATGTTTCAATUWUUWWV^^ 



FIGURE 147 

</usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA45417 
<subunit 1 of 1, 500 aa, 1 stop 
<MW: 56888, pi: 8.53, NX(S/T): 2 

MKCTAREWLRVTTVLFMARAI PAMWPNATLLEKLLEKYMDEDGEWWIAKQRGKRAITDNDMQS I 
LDLHNKLRSQVYPTASNMEYMTWDVELERSAESWAESCLWEHGPASLLPSIGQNLGAHWGRYRPP 
TFHVQSWYDEVKDFSYPYEHECNPYCPFRCSGPVCTHYTQWWATSNRIGCAINLCHNMNIWGQI 
WPKAVYLVCNYSPKGNWWGHAPYKHGRPCSACPPSFGGGCRENLCYKEGSDRYYPPREEETNEIE 
RQQSQVHDTHVRTRSDDSSRNEVISAQQMSQIVSCEVRLRDQCKGTTCNRYECPAGCLDSKAKVI 
GSVHYEMQSSICRAAIHYGIIDNDGGWVDITRQGRKHYFIKSNRNGIQTIGKYQSANSFTVSKVT 
VQAVTCETTVEQLCPFHKPASHCPRVYCPRNCMQANPHYARVIGTRVYSDLSSICRAAVHAGWR 
NHGGYVDVMPVDKRKTYIASFQNGIFSESLQNPPGGKAFRVFAW 

Important features : 
Signal peptide: 

amino acids 1-20 

Extracellular proteins SCP/Tpx-l/Ag5/PR-l/Sc7 protein 

amino acids 165-186, 196-218, 134-146, 96-108 and 58-77 



N- glycosylat ion s i te 

amino acids 28-31 




FIGURE 148 

GCGGAGACAAGCGCAGAGCGCAGCGCACGGCCACAGACAGCCCTGGGCATCCACC(^ 

CCGGAGCCAGCAGAGCCGGAAGGCGCGCCCCGGGCAGAGAAAGCCGAGCAGAGCTGGGTGGCGTC 

TCCGGGCCGCCGCTCCGACGGGCCAGCGCCCTCCCCATGTCCCTGCTCCCACGCCGCGCCCCTGC 

GGTCAGCATGAGGCTCCTGGCGGCCGCGCTGCTCCTGCTGCTGCTGGCGCTGTACACCGCGCGTG 

TGGACGGGTCCT^AATGC^GTGCTCCCGGAAGGGACCCAAGATCCGCTACAGCGACGTGA^ 

CTGGAAATGAAGCCAAAGTACCCGO^CTGCGAGGAGAAGATGGTTATCATCTVCCACC^ 

GTCCAGGTACCGAGGTCSVGGAGCACTGCCTGCACCCCAAGCTGCAGAGC^ 

AGTGGTACAACGCCTGGAACGAGAAGCGCAGGGTCTACGAAGA ATAGG GTGAAAAACCTCAGAAG 

GGAAAACTCCAAACCT^GTTGGGAGACTTGTGCAAAGGACTTTGCAGATTAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAGCCTTTCTTTCTCACAGGCATAAGACACAAA 

ATTGTTATGAAGCACTTTTTACCAACGGTCAGTTTTTACATTTTATAGCTGCGTGCGAAAGGCTT 

CCAGATGGGAGACCCATCTCTCTTGTGCTCCAGACTTCATCACAGGCTGCTTTTTATCAAAAAGG 

GGAAAACTCATGCCTTTCCTTTTTAAAAAATGCTTTTTTGTATTTGTCCATACGTCACTATACAT 

CTGAGCTTTATAAGCGCCCGGGAGGAACAATGAGCTTGGTGGACACATTTCATTGCAGTGTTGCT 

CCATTCCTAGCTTGGGAAGCTTCCGCTTAGAGGTCCTGGCGCCTCGGCACAGCTGCCACGGGCTC 

TCCTGGGCTTATGGCCGGTCACAGCCTCAGTGTGACTCCACAGTGGCCCCTGTAGCCGGGCAAGC 

AGGAGCAGGTCTCTCTGCATCTGTTCTCTGAGGAACTCAAGTTTGGTTGCCAGAAAAATGTGCTT 

CATTCCCCCCTGGTTAATTTTTACACACCCTAGGAAACATTTCCAAGATCCTGTGATGGCGAGAC 

AAATGATCCTTAAAGAAGGTGTGGGGTCTTTCCCAACCTGAGGATTTCTGAAAGGTTCACAGGTT 

CAATATTTAATGCTTCAGAAGCATGTGAGGTTCCCAACACTGTCAGCAAAAACCTTAGGAGAAAA 

CTTAAAAATATATGAATACATGCGCAATACACAGCTACAGACACACATTCTGTTGACAAGGGAAA 

ACCTTCAAAGCATGTTTCTTTCCCTCACCACAACAGAACATGCAGTACTAAAGCAATATATTTG 

GATTCCCCATGTAATTCTTCAATGTTAAACAGTGCAGTCCTCTTTCGAAAGCTAAGATGACCATG 

CGCCCTTTCCTCTGTACATATACCCTTAAGAACGCCCCCTCCACACACTGCCCCCCAGTATATGC 

CGCATTGTACTGCTGTGTTATATGCTATGTACATGTCAGAi^CCATTAGCATTGCATGCAGGI^ 

CATATTCTTTCTAAGATGGAAAGTAATAAAATATATTTGAAATGTAAAAAAAAAAAAAAA 



FIGURE 149 

MSLLPRRAPPVSMRLLAAMiLLLLLALYTARVDGSKCKCSRKGPKIRYSDVKK^ 
KMVIITTKSVSRYRGQEHCLHPKLQSTKRFIKWYNAWNEKRRVYEE 



S ignal s equenc e : 

amino acids 1-34 




FIGURE 150 

gccccagggactgctatggcttcctttgttgttcaccccggtctgcgtc atg ttaaactccaatgtcctcctgtggtt 

aactgctcttgccatcaagttcaccctcattgacagccaagcacagtatccagttgtcaacacaaattatggcaaaat 

ccggggcctaagaacaccgttacccaatgagatcttgggtccagtggagcagtacttaggggtcccctatgcctcacc 

ccccactggagagaggcggtttcagcccccagaacccccgtcctcctggactggcatccgaaatactactcagtttgc 

tgctgtgtgcccccagcacctggatgagagatccttactgcatgacatgctgcccatctggtttaccgccaatttgga 

tactttgatgacctatgttcaagatcaaaatgaagactgcctttacttaaacatctacgtgcccacggaagatggagc 

caacacaaagaaaaacgcagatgatataacgagtaatgaccgtggtgaagacgaagatattcatgatcagaacagtaa 

gaagcccgtcatggtctatatccatgggggatcttacatggagggcaccggcaacatgattgacggcagcattttggc 

aagctacggaaacgtcatcgtgatcaccattaactaccgtctgggaatactagggtttttaagtaccggtgaccaggc 

agcaaaaggcaactatgggctcctggatcagattcaagcactgcggtggattgaggagaatgtgggagcctttggc^ 

ggaccccaagagagtgaccatctttggctcgggggctggggcctcctgtgtcagcctgttgaccctgtcccactactc 

agaaggtctcttccy^gaaggccatcattcagagcggcaccgccctgtccagctgggcagtgaactacc^ 

gtacactcggatattggcagacaaggtcggctgcaacatgctggacaccacggacatggtagaatgcctgcggaaca^ 

gaactacaaggagctcatccagcagaccatcaccccggccacctaccacatagccttcgggccggt 

cgtcatcco^gacgacccccagatcctgatggagcaaggcgagttcctcaactacgacyvtcatgctgggcgtc^ 

aggggaaggcctgaagttcgtggacggcatcgtggataacgaggacggtgtgacgcccaacgactttg 

gtccaacttcgtggacaacctttacggctaccctgaagggaaagacactttgcgggagactatcaagttcatgtacac 

agactgggccgataaggaaaacccggagacgcggcggaaaaccctggtggctctctttactgaccaccagtgggtggc 

ccccgccgtggccgccgacctgcacgcgcagtacggctcccccacctacttctatgccttctatcatcactgccaaag 

cgaaatgaagcccagctgggcagattcggcccatggtgatgaggtcccctatgtcttcggcatccccatgatcggtcc 

caccgagctcttcagttgtaacttttccaagaacgacgtcatgctcagcgccgtggtcatgacctactggacgaactt 

cgccaaaactggtgatccaaatcaaccagttcctcaggataccaagttcattcacacaaaacccaaccgctttg7vaga 

agtggcctggtccaagtataatcccaaagaccagctctatctgcatattggcttgaaacccagagtgagagatcacta 

ccgggcaacgaaagtggctttctggttggaactcgttcctcatttgcacaacttgaacgagatattccagtatgtttc 

aacaaccacaaaggttcctccaccagacatgacatcatttccctatggcacccggcgatctcccgccaagatatggcc 

aaccaccaaacgcccagcaatcactcctgccaacaatcccaaacactctaaggaccctcacaaaacagggcctgagga 

cacaactgtcctcattgaaaccaaacgagattattccaccgaattaagtgtcaccattgccgtcggggcgtcgctcct 

cttcctcaacatcttagcttttgcggcgctgtactacaaaaaggacaagaggcgccatgagactcacaggcgccccag 

tccccagagaaacaecacaaatgatatcgctcacatccagaacgaagagatcatgtctctgcagatgaagcagctgga 

acacgatcacgagtgtgagtcgctgcaggcacacgacacactgaggctcacctgcccgccagactacaccctcacgct 

gcgccggtcgccagatgacatcccacttatgacgccaaacaccatcaccatgattccaaacacactgacggggatgca 

gcctttgcactvcttttaacaccttcagtggaggacaaaacagtacaaatttaccccacggacattccac^ 

atag ctttgccctatttcccttcctatccctctgccctacccgctcagcaacatagaagagggaaggaaagagagaag 

gaaagagagagagaaagaaagtctccagaccaggaatgtttttgtcccactgacttaagacaaaaatgcaaa;^ 

gtctvtcccatcccggcagacccttatcgttggtgttttccagtattacaagatcaacttctgaccctgtga^ 

gaagtacacatttctgttaaaataactgctttaagatctctaccactccaatcaatgtttagtgtgataggacatcac 

catttcaaggccccgggtgtttccaacgtcatggaagcagctgacacttctgaaactcagccaaggacac 

ttttaattacaatggaagtttaaacatttctttctgtgccacacaatggatggctctccttaagtgaag 

atgagattttgcccagcacatggagctgtaatccagagagaaggaaacgtagaaatttattattaaaagaatggactg 

tgcagcgaaatctgtacggttctgtgct^aagaggtgttttgcctvgcctgaactatatto 



FIGURE 151 

MLNSNVLLWLTALAI KFTL I DSQAQY P WNTNYGKI RGLRTPLPNE I LGPVEQYLGVP YAS PPTG 

ERRFQPPEPPSSWTGIRNTTQFAAVCPQHLDERSLLHDMLPIWFTANLDTLMTYVQDQNEDCLYL 

NI YVPTEDGANTKKNADD ITSNDRGEDEDI HDQNSKKPVMVY I HGGS YMEGTGNM I DGS I LAS YG 

WIVITIlSnfRLGILGFLSTGDQAAKGNYGLLDQIQALRWIEEWGAFGGDPKRVT 

VS LLTLSH YS EGLFQKAI IQSGTALS S WAVNYQPAKYTR I LADKVGCNMLDTTDMVECLRNKNYK 

ELIQQTITPATYHIAFGPVIDGDVIPDDPQILMEQGEFLNYDIMLGVNQGEGLKFVDGIVDNEDG 

VTPNDFDFS VSNFVDNL YGYPEGKDTLRET I KFMYTDWADKENPETRRKTLVALFTDHQWVAPAV 

AADLHAQYGSPTYFYAFYHHCQSEMKPSWADSAHGDEVPYVFGIPMIGPTELFSCNFSKNDVMLS 

AWMTYWTNFAKTGDPNQPVPQDTKFIHTKPNRFEEVAWSKYNPKDQLYLHIGLKPRVRDHYRAT 

KVAFWLELVPHLHNLNEIFQYVSTTTKVPPPDMTSFPYGTRRSPAKIWPTTKRPAITPANNPKHS 

KDPHKTGPEDTTVLIETKRDYSTELSVTIAVGASLLFLNILAFAALYYKKDKRRHETHRRPSPQR 

NTTNDIAHIQNEEIMSLQMKQLEHDHECESLQAHDTLRLTCPPDYTLTLRRSPDDIPLMTPNTIT 

MI PNTLTGMQPLHTFNTFSGGQNSTNLPHGHSTTRV 

Signal sequence : 

amino acids 1-24 



Transmembrane domains : 

amino acids 189-204, 675-692 




FIGURE 152 

GGGAAA GATGG CGGCGACTCTGGGACCCCTTGGGTCGTGGCAGCAGTGGCGGCGATGTTTGTCGG 
CTCGGGATGGGTCCAGGATGTTAGTCCTTCTTCTTTTGTTGGGGTCTGGGCAGGGGCCACAGC^ 
GTCGGGGCGGGTCAAACGTTCGAGTACTTGAAACGGGAGCACTCGCTGTCGAAGCCCTACCAGGG 
TGTGGGCACAGGCAGTTCCTCACTGTGGAATCTGATGGGCAATGCCATGGTGATGACCCAGTATA 
TCCGCCTTACCCCAGATATGCAAAGTAAACAGGGTGCCTTGTGGAACCGGGTGCCATGTTTCCTG 
AGAGACTGGGAGTTGCAGGTGCACTTCAAAATCCATGGACAAGGAAAGAAGAATCTGCATGGGGA 
TGGCTTGGCAATCTGGTACACAAAGGATCGGATGCAGCCAGGGCCTGTGTTTGGAAACATGGACA 
AATTTGTGGGGCTGGGAGTATTTGTAGACACCTACCCCAATGAGGAGAAGCAGCAAGAGCGGGTA 
TTCCCCTACATCTCAGCCATGGTGAACAACGGCTCCCTCAGCTATGATCATGAGCGGGATGGGCG 
GCCTACAGAGCTGGGAGGCTGCACAGCCATTGTCCGCAATCTTCATTACGACACCTTCCTGGTGA 
TTCGCTACGTCAAGAGGCATTTGACGATAATGATGGATATTGATGGCAAGCATGAGTGGAGGGAC 
TGCATTGAAGTGCCCGGAGTCCGCCTGCCCCGCGGCTACTACTTCGGCACCTCCTCCATCACTGG 
GGATCTCTCAGATAATCATGATGTCATTTCCTTGAAGTTGTTTGAACTGACAGTGGAGAGAACCC 
CAGAAGAGGAAAAGCTCCATCGAGATGTGTTCTTGCCCTCAGTGGACAATATGAAGCTGCCTGAG 
ATGACAGCTCCACTGCCGCCCCTGAGTGGCCTGGCCCTCTTCCTCATCGTCTTTTTCTCCCTGGT 
GTTTTCTGTATTTGCCATAGTCATTGGTATCATACTCTACAACAAATGGCAGGAACAGAGCCGAA 
AGCGCTTCTACTGAGCCCTCCTGCTGCCACCACTTTTGTGACTGTCACCCATGAGGTATGGAAGG 
AGCAGGCACTGGCCTGAGCATGCAGCCTGGAGAGTGTTCTTGTCTCTAGCAGCTGGTTGGGGACT 
ATATTCTGTCACTGGAGTTTTGAATGCAGGGACCCCGCATTCCCATGGTTGTGCATGGGGACATC 
TAACTCTGGTCTGGGAAGCCACCCACCCCAGGGCAATGCTGCTGTGATGTGCCTTTCCCTGCAGT 
GCTTCCATGTGGGAGCAGAGGTGTGAAGAGAATTTACGTGGTTGTGATGCCAAAATCACAGAACA 
GAATTTCATAGCCCAGGCTGCCGTGTTGTTTGACTCAGAAGGCCCTTCTACTTCAGTTTTGAATC 
CACAAAGAATTAAAAACTGGTAACACCACAGGCTTTCTGACCATCCATTCGTTGGGTTTTGCATT 
TGACCCAACCCTCTGCCTACCTGAGGAGCTTTCTTTGGAAACCAGGATGGAAACTTCTTCCCTGC 
CTTACCTTCCTTTCACTCCATTCATTGTCCTCTCTGTGTGCAACCTGAGCTGGGAAAGGCATTTG 
GATGCCTCTCTGTTGGGGCCTGGGGCTGCAGAACACACCTGCGTTTCACTGGCCTTCATTAGGTG 
GCCCTAGGGAGATGGCTTTCTGCTTTGGATCACTGTTCCCTAGCATGGGTCTTGGGTCTATTGGC 
ATGTCCATGGCCTTCCCAATCAAGTCTCTTCAGGCCCTCAGTGAAGTTTGGCTAAAGGTTGGTGT 
AAAAATCAAGAGAAGCCTGGAAGACATCATGGATGCCATGGATTAGCTGTGCAACTGACCAGCTC 
CAGGTTTGATCAAACCAAAAGCAACATTTGTCATGTGGTCTGACCATGTGGAGATGTTTCTGGAC 
TTGCTAGAGCCTGCTTAGCTGCATGTTTTGTAGTTACGATTTTTGGAATCCCAeTTTGAGTGCTG 
AAAGTGTAAGGAAGCTTTCTTCTTACACCTTGGGCTTGGATATTGCCCAGAGAAGAAATTTGGCT 
TTTTTTTTCTTAATGGACAAGAGACAGTTGCTGTTCTCATGTTCCAAGTCTGAGAGCAACAGACC 
CTCATCATCTGTGCCTGGAAGAGTTCACTGTCATTGAGCAGCACAGCCTGAGTGCTGGCCTCTGT 
CAACCCTTATTCCACTGCCTTATTTGACAAGGGGTTACATGCTGCTCACCTTACTGCCCTC3GGAT 
TAAATCAGTTACAGGCCAGAGTCTCCTTGGAGGGCCTGGAACTCTGAGTCCTCCTATGAACCTCT 
GTAGCCTAAATGAAATTCTTAAAATCACCGATGGAACCAAAAAAAAAAAAAAAAAGGGCGGCCGC 
GACTCTAGAGTCGACCTGCAGTAGGGATAACAGGGTAATAAGCTTGGCCGCCATGG 



FIGURE 153 

></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA50911 
xsubunit 1 of 1, 348 aa, 1 stop 
xMW: 39711, pi: 8.70, NX(S/T): 1 

MAATLGPLGSWQQWRRCLSARDGSRMLLLLLLLGSGQGPQQVGAGQTFEYLKREHSLSKPYQGVG 

TGSSSLWNLMGNAMWTQYIRLTPDMQSKQGALWNRVPCFLRDWELQVHFKIHGQGKKNLHGDGL 

AIWYTKDRMQPGPVFGNMDKFVGLGVFVDTYPNEEKQQERVFPYISAMVNNGSLSYDHERTC 

ELGGCTAIVRl^HYDTFLVIRYVKRHLTIMMDIDGKHEWRDCIEVPGVRLPRGYYFGTSSITGDL 

SDmDVISLKLFELWERTPEEEKLHRDVFLPSVDNMKLPEMTAPLPPLSGLALFLIVFFSLVFS 

VFAIVIGIILYNKWQEQSRKRFY 

Signal sequence: 

amino acids 1-38 

Transmembrane domain: 

amino acids 310-329 




FIGURE 154 

CCGAGCCGGGCGCGCAGCGACGGAGCTGGGGCCGGCCTGGGACCATGGGCGTGAGTGCAATCTACGGATCAGTCTCTG 

ATGGTGGGTCGTTAACCTCAGTGGGGACTCCAAGATTTCCATGAAGAAAATCAGTTGTCTTCATTCAAGAATTGGGGT 

CTGGCTCAGAATTCCTGCAGCTGGTGAAAATCTGTTTTCTAGAAGAGGTTTAATTAATGCCTGCAGTCTGACATGTTC 

CCGATTTGAGGTGAAACCATGAAGAGAAAATAGAATACTTAATA ATG CtTTTCCGCAACCGCTTCTTGCTGCTGCTGG 

CCCTGGCTGCGCTGCTGGCCTTTGTGAGCCTCAGCCTGCAGTTCTTCCACCTGATCCCGGTGTCGACTCCTAAGAATG 

GAATGAGTAGCAAGAGTCGAAAGAGAATCATGCCCGACCCTGTGACGGAGCCCCCTGTGACAGACCCCGTTTATGAAG 

CTCTTTTGTACTGCAACATCCCCAGTGTGGCCGAGCGCAGCATGGAAGGTCATGCCCCGCATCATTTTAAGCTGGTCT 

CAGTGCATGTGTTCATTCGCCACGGAGACAGGTACCCACTGTATGTCATTCCCAAAACAAAGCGACCAGAAATTGACT 

GCACTCTGGTGGCTAACAGGAAACCGTATCTVCCCAAAACTGGAAGeTTTCATTAGTCACATC 

CCTCTTTCGAAAGCCCCTTGAACTCCTTGCCTCTTTACCCAAATCACCCATTGTGTGAGATGGGAGAGCT 

CAGGAGTTGTGCAGCATTTGCAGAACGGTCAGCTGCTGAGGGATATCTATCTAAAGAAACACAAACTCCTC 

ATTGGTCTGCAGACCAGCTCTATTTAGAGACCACTGGGAAAAGCCGGACCCTACAAAGTGGGCTGGCCOT 

GCTTTCTCCCAGATTTTGACTGGAAGAAGATTTATTTCAGGCT^CCAGCCAAGTGCGCTGTT^^ 

ATTGCCCGGTAAGT^AACCAGTATCTGGAAAAGGAGCAGCGTCGTCAGTACCTCCTACGTTTGAAAAACAGCCAGCTGG 

AGAAGACCTACGGGGAGATGGCCAAGATCGTGGATGTCCCCACCAAGCAGCTTAGAGCTGCCAACCCCATAC^ 

TGCTCTGCCACTTCTGCCACAATGTCAGCTTTCCCTGTACCAGAAATGGCTGTGT^ 

TTAAGACCCATCAGATCGAGGATGAAAGGGAAAGACGGGAGAAGAAATTGTACTTCGGGTATTCTCTCCTGGGTGCCC 

ACCeCATCCTGAACCAAACCATCGGCCGGATGCAGCGTGCCACCGAGGGCAGGAAAGAAGAGCTCTTTGCCCTCTACT 

CTGCTCATGATGTCACTCTGTCACCAGTTCTCAGTGCCTTGGGCCTTTCAGAAGCCAGGTTCCCAAGGTTTGCAGCCA 

GGTTGATCTTTGAGCTTTGGCAAGACAGAGAAAAGCCCAGTGAACATTCCGTCCGGATTCTTTACAATGGCGTCGATG 

TCACATTCCACACCTCTTTCTGCCAAGACCACCACAAGCGTTCTCCCAAGCCCATGTGCCCGCTTGAAAACTTGGTCC 

GCTTTGTGAAAAGGGACATGTTTGTAGCCCTGGGTGGCAGTGGTACAAATTATTATGATGCATGTCACAGGGAAGGAT 

TC TAAA AGGTATGCAGTACAGCAGTATAGAATCCATGCCAATACAGAGCATAGGGAAAGGTCCACTTCTAGTTTTGTC 

TGTTACTAAGGGTAGAAGATTATTGCTTTTTAAAGGCTAAATATTGTTTGTGGGAACCACAGATGGTTGGGGTTGAAC 

AGTAAGCACATTGCTGCAATGTGGTACGTGAATTGCTTGGTACAAAATGGCCAGTTCACAGAGGAATAGAAGGTACTT 

TATCATAGCCAGACTTCGCTTAGAATGCCAGAATAATATAGTTCAAGACCTGAAGTTGCCAATCCAAGTTTGCACTCT 

TCTGGCCTGCCCCATGTTACTATGTGATGGAACCAGCACACCTCAACCAAAATTTTTTTAATCTTAGACATTTTTACC 

TTGTCCTTGTTAAGAATTTCTTGAAGTGATTTATCTAAAATAAAGGTTGGCAAACTTTTTCTGTAAAGGGCCAGATTG 

TAAATATTTCAGACTGTGTGGACCAAAAGGCCACATACAGTCTCTGTCATAACTACTGAACTCTGTTTCTGAAGCAGG 

AAAGCCACCACAGACAGTACATAAAGGAATATGTGTAGCTGGGTTCCCAGGCCAGACAAAACAGATGGTGACCAGACT 

TGGCCCCTGGGCTGTAGTTTGCTGACCCCTCATCTAAAAAATAGGCTATACTACTUVTTGCACTTCCAGCACTTTGA^ 

ACGAGTTGAATACCT^GAATTATTCAATGGTTCCTCCAGTAACTTCTGCTAGAAACACAGAATTTGGTCTGTATCT 

CACTAGAACAAAACTTGAGGGTAAATAAACATTGAATTAGAATGAATCATAGAAAACTGATTAGAAGAATACTTGATG 

TTTATGATGATTGTGGTACAAGATAGTTTTAAGTATGTTCTAAATATTTGTCTGCTGTAGTCT^ 

TGAAATTTTTGTATGCCATTTAGTATTTTTATAGTTTAGGAAAATATTTTCTAAGACCAGTTTTAGATGACTCTTATT 

CCTGTAGTAATATTCAATTTGCTGTACCTGCTTGGTGGTTAGAAGGAGGCTAGAAGATGAATTCAGGCACTTTCOT 

AATATUUVCTAATTATGGCTCATTCCCTTTGACAAGCTGTAGAACTGGATTCATTTTTAAACCATTT^ 

AATGGTAAATTCTGATTGATTTTTAAATGCGTTTTTGGAAGAACTTTGCTATTAGGTAGTTTACAGATCTTTATAAGG 

TGTTTTATATATTAGAAGCAATTATAATTACATCTGTGATTTCTGAACTAATGGTGCTAATTCAGAGAAATGGAAAGT 

GAAAGTGAGATTCTCTGTTGTCATCGGCATTCCAACTTTTTCTCTTTGTTTTTGTCCAGTGTTGCATTTG7VATATGTC 

TGTTTCTATAAATAAATTTTTTAAGAATAA 




FIGURE 155 

></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA48329 
xsubunit 1 of 1, 480 aa, 1 stop 
xMW: 55240, pi: 9.30, NX(S/T): 2 

MLFRNRFLLLLALAALl^FVSLSLQFFHLIPVSTPKNGMSSKSRKRIMPDPWEPPVTDPW 

LYCNIPSVAERSMEGHAPHHFKLVSVHVFIRHGDRYPLWIPKTKRPElbCTLVANRKPYHPKLE 

AFISHMSKGSGASFESPLNSLPLYPNHPLCEMGELTQTGWQHLQNGQLLRDIYLKKHKLLPNDW 

SADQLYLETTGKSRTLQSGLALLYGFLPDFDWKKIYFRHQPSALFCSGSCYCPVRNQYLEKEQRR 

QYLLRLKNSQLEKTYGEMAKIVDVPTKQLRAANPIDSMLCHFCHWSFPCTRNGCVDMEHFKVIK 

THQ I EDERERRE KKLYFGYS LLGAHP I LNQT I GRMQRATEGRKE ELFALYS AHDVTLS PVLS ALG 

LSEARFPRFAARLIFELWQDREKPSEHSVRILYNGVDVTFHTSFCQDHHKRSPKPMCPLENLVRF 

VKRDMFVALGGSGTNYYDACHREGF 

Signal sequence: 
amino acids 1-18 




FIGURE 156 

AAAAAAGCTCACTAAAGTTTCTATTAGAGCGAATACGGTAGATTTCCATCCCCTTTTGAAGAACAGTACTGTGGAGCT 

ATTTAAGAGATAAAAACGAAATATCCTTTCTGGGAGTTCAAGATTGTGCAGTAATTGGTTAGGACTCTGAGCGCCGCT 

GTTGACCAATCGGGGAGAGAAAAGCGGAGATCCTGCTCGCCTTGCACGCGCCTGAAGCACAAAGCAGATAGCTAGGAA 

TG7VACCATCCCTGGGAGTATGTGGAAACAACGGAGGAGCTCTGACTTCCCAACTGTCCCATTCTATGGGCGAAGGAAC 

TGCTCCTGACTTCAGTGGTTAAGGGCAGAATTGAAAATAATTCTGGAGGAAGATAAG AATGA TTCCTGCGCGACTGCA 

CCGGGACTACAAAGGGCTTGTCCTGCTGGGAATCCTCCTGGGGACTCTGTGGGAGACCGGATGCACCCAGATACGCTA 

TTCAGTTCCGGAAGAGCTGGAGAAAGGCTCTAGGGTGGGCGACATCTCCAGGGACCTGGGGCTGGAGCCCCGGGAGCT 

CGCGGAGCGCGGAGTCCGCATCATCCCCAGAGGTAGGACGCAGCTTTTCGCCCTGAATCCGCGCAGCGGCAGCTTGGT 

CACGGCGGGCAGGATAGACCGGGAGGAGCTCTGTATGGGGGCCATCAAGTGTCAATTAAATCTAGACATTCTGATGGA 

GGATAAAGTGAAAATATATGGAGTAGAAGTAGAAGTAAGGGACATTAACGACAATGCGCCTTACTTTCGTGAAAGTGA 

ATTAGAAATAAi\AATTAGTGAAAATGCAGCCACTGAGATGCGGTTCCCTCTACCCCACGCCTGGGATC 

GAAGAACTCTCTGCAGAGCTACGAGCTCAGCCCGAACACTCACTTCTCCCTCATCGTGCAAAATGGAGCCGACGGTAG 

TAAGTACCCCGAATTGGTGCTGAAACGCGCCCTGGACCGCGAAGAAAAGGCTGCTCACC7VCCTGGTCCTTACGGCCTC 

CGACGGGGGCGACCCGGTGCGCACy^GGCACCGCGCGCATCCGCGTGATGGTTCTGGATGCGAACGACAACGCACC^ 

GTTTGCTCAGCCCGAGTACCGCGCGAGCGTTCCGGAGT^TCTGGCCTTGGGCACGCAGCTGCTTGTAGTCAACGCTAC 

CGACCCTGACGAAGGAGTCTU^TGCGGAAGTGAGGTATTCCTTCCGGTATGTGGACGACAAGGCGGCCCAAGOT 

ACTAGATTGTAATTCAGGGACAATATCAACAATAGGGGAGTTGGACCACGAGGAGTCAGGATTCTACCAGAT 

GCAAGCAATGGATAATGCAGGATATTCTGCGCGAGCCAAAGTCCTGATCACTGTTCTGGACGTGAA 

AGAAGTGGTCCTCACCTCTCTCGCCAGCTCGGTTCCCGAAAACTCTCCCAGAGGGACATTAATTGCCCTTTTAAATGT 

AAATGACCAAGATTCTGAGGAAAACGGACAGGTGATCTGTTTCATCCAAGGAAATCTGCCCTTTAAATTAGAAAAATC 

TTACGGAAATTACTATAGTTTAGTCACAGACATAGTCTTGGATAGGGAACAGGTTCCTAGCTACAACATCACAGTGAC 

CGCCACTGACCGGGGAACCCCGCCCCTATCCACGGAAACTCATATCTCGCTGAACGTGGCAGACACCAACGACAACCC 

GCCGGTCTTCCCTCAGGCCTCCTATTCCGCTTATATCCCAGAGAACAATCCCAGAGGAGTTTCCCTCGTCTCTGTGAC 

CGCCCACGACCCCGACTGTGAAGAGAACGCCCAGATCACTTATTCCCTGGCTGAGAACACCATCCAAGGGGCAAGCCT 

ATCGTCCTACGTGTCCATCAACTCCGACACTGGGGTACTGTATGCGCTGAGCTCCTTCGACTAeGAGCAGTTCCGAGA 

CTTGCAAGTGAAAGTGATGGCGCGGGACAACGGGGACCCGCCCCTCAGCAGCAACGTGTCGTTGAGCCTGTTCGTGCT 

GGACCAGAACGACAATGCGCCCGAGATCCTGTACCCCGCCCTCCCCACGGACGGTTCCACTGGCGTGGAGCTGGCTCC 

CCGCTCCGCAGAGCCCGGCTACCTGGTGACCAAGGTGGTGGCGGTGGACAGAGACTCCGGCCAGAACGCCTGGCTGTC 

CTACCGTCTGCTCAAGGCCAGCGAGCCGGGACTCTTCTCGGTGGGTCTGCACACGGGCGAGGTGCGCACGGCGCGAGC 

CCTGCTGGACAGAGACGCGCTCAAGCAGAGCCTCGTAGTGGCCGTCCAGGACCACGGCCAGCCCCCTCTCTCCGCCAC 

TGTCACGCTCACCGTGGCCGTGGCCGACAGCATCCCCCAAGTCCTGGCGGACCTCGGCAGCCTCGAGTCTCCAGCTAA 

CTCTGAAACCTCAGACCTCACTCTGTACCTGGTGGTAGCGGTGGCCGCGGTCTCCTGCGTCTTCCTGGCCTTCGTCAT 

CTTGCTGCTGGCGCTCAGGCTGCGGCGCTGGCACAAGTCACGCCTGCTGGAGGCTTCAGGAGGCGGCTTGACAGGAGC 

GCCGGCGTCGCACTTTGTGGGCGTGGACGGGGTGCAGGCTTTCCTGCAGACCTATTCCCACGAGGTTTCCCTCACCAC 

GGACTCGCGGAAGAGTCACCTGATCTTCCCCCAGCCCAACTATGCAGACATGCTCGTCAGCCAGGAGAGCTTTGAAAA 

AAGCGAGCCCCTTTTGCTGTCAGGTGATTCGGTATTTTCTAAAGACAGTCATGGGTTAATTGAGGTGAGTTTATATC^ 

TCAAACTCCTAGGCTCAAGCAATTATCCCACCTTTGCCTCCGGTGTAACAGGGACTACAGGTGCAAGCCACCTAC 

CTGCCTATCTATCTATCTATCTATCTATCTATCTATCTATCTATCTATCTATCTATTACTTTCTTGTACAGACGGGAG 

TCTCACGCCTGTAATCCCAGTACTTTGGGAGGCCGAGGCGGGTGGATCACCTGAGGTTGGGAGTTTGAGACCAGCCTG 

ACCAACATGGAGAAACCCCGTCTATACTAAAAAAATACAAAATTAGCCGGGCGTGGTGGTGCATGTCTGTAATCCCAG 

CTACTTGGGAGGCTGAGTCAGGAGAATTGCTTTAACCTGGGAGGTGGAGGTTGCAATGAGCTGAGATTGTGCCATTC 

ACTCCAGCCTGGGCAACAAGAGTGAAACTCTATCTCA 



FIGURE 157 

></usr/seqdb2/sst/DNA/Dnaseqs .min/ss . DNA48306 
xsubunit 1 of 1, 916 aa, 1 stop 
xMW: 100204, pi: 4.92, NX (S/T) : 4 

MIPARLHRDYKGLVLLGILLGTLWETGCTQIRYSVPEELEKGSRVGDISRDLGLEPRELAERGVR 
ilPRGRTQLFALNPRSGSLVTAGRIDREELCMGAIKCQLNLDILMEDKVKIYGVEVEVRDINDNA 
PYFRESELEIKISENAATEMRFPLPHAWDPDIGKNSLQSYELSPNTHFSLIVQNGADGSKYPELV 
LKRALDREEKAAHHLVLTASDGGDPWTGTARIRVMVLDANDNAPAFAQPEYRASVPENL^ 
. LLVVNATDPDEGVNAEWYSFRYVDDKAAQVFKLDCNSGTISTIGELDHEESGFYQMEVQAMDNA 
GYSARAKVLIWLDWDNAPEVVLTSLASSVPENSPRGTLIALLIWNDQDSEENGQVICFIQGNL 
PFKLEKSYGNYYSLVTDIVLDREQVPSYNITVTATDRGTPPLSTETHISLNVADTNDNPPVFPQA 
SYSAYIPENNPRGVSLVSVTAHDPDGEENAQITYSLAENTIQGASLSSYVSjNSDTGVLYALSSF 
DYEQFRDLQVKVMARDNGHPPLSSNVSLSLFVLDQNDNAPEILYPALPTDGSTGVELAPRSAEPG 
YLVTKWAVDRDSGQNAWLSYRLLKASEPGLFSVGLHTGEVRTARALLDRDALKQSLWAVQDHG 
QPPLSATVTLWAVADS I PQVIJUDLGSLESPANSETSDLTLYLWAVAAVSCVFIAF^ 
LRRWHKSRLLQASGGGLTGAPASHFVGVDGVQAFLQTYSHEVSLTTDSRKSHLIFPQPNYADMLV 
SQESFEKSEPLLLSGDSVFSKDSHGLIEVSLYQIFFLFFFNCSVSQAGVQRYDHSSLRPQTPRLK 
QLSHLCLRCNRDYRCKPPTVCLSTYLSIYLSIYLSIYLLLSCTDGSLTPVIPVLWEAEAGGSPEV 
GSLRPA 

Signal sequence: 

amino acids 1-30 

Transmembrane domains: 

amino acids 693-711, 809-823, 869-888 



FIGURE 158 

CCO^GGCTCTAGTGCT^GGAGGAGAAGGAGGAGGAGCAGGAGGTGGAGATTCCCAGT 
CCAGAATCGTGTACCT^GGCAGAGAACTGAAGTACTGGGGCCTCCTCCTVCTQ 

GGTGACCCCGCCCCTGGATTCTGQAAGACCTCAC CATGG GACGCCCCCGACCTCGTGCGGCCAAG 

ACGTGGATGTTCCTGCTCTTGCTGGGGGGAGCCTGGGCAGGACACTCCAGGGCACAGGAG^ 

GGTGCTGGGGGGTCATGAGTGCCT^CCCCATTCGCAGCCTTGGCAGGCGGCCTTGT^ 

AGCAACTACTCTGTGGCGGTGTCCTTGTAGGTGGCAACTGGGTCCTTACAGCTGCCCACTG 

AAACCGAAATACACAGTACGCCTGGGAGACCACAGCCTACAGAATAAAGATGGCCCAGAGi^^ 

AATACCTGTGGTTCAGTCCATCCCACACCCCTGCTACAACTIGCAGCGATGTGGAC^ 

ATGATCTGATGCTTCTTCAACTGCGTGACCAGGCATCCCTGGGGTCCAAAGTGAAGCCCATCAGC 

CTGGCAGATCATTGCACCCAGCCTGGCCAGAAGTGCACCGTCTCAGGCTGGGGCACTGTCACCAG 

TCCCCGAGAGAATTTTCCTGACACTCTCAACTGTGCAGAAGTAAAAATCTTTCCCCAGAAGAAGT 

GTGAGGATGCTTACCCGGGGCAGATCACAGATGGCATGGTCTGTGCAGGCAGCAGCAAAGGGGCT 

GACACGTGCCAGGGCGATTCTGGAGGCCCCCTGGTGTGTGATGGTGCACTCCAGGGCATCACATC 

CTGGGGCTCAGACCCCTGTGGGAGGTCCGACAAACCTGGCGTCTATACC?U^CATCTG 

TGGACTGGATCAAGAAGATCATAGGCAGCAAGGG CTGA TTCTAGGATAAGCACTAGATCTCCCTT 

AATAAACTCACAACTCTCTGGTTC 



FIGURE 159 

</usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA48336 
<subunit 1 of 1, 260 aa, 1 stop 
<MW: 28048, pi: 7.87, NX (S/T) : 1 

MGRPRPRAAKTWMFLLLLGGAWAGHSRAQEDK\n^GGHECQPHSQPWQAALFQGQQLLCGGV^ 
NWVLTAAHCKKPKYTVRLGDHSLQNKDGPEQEI PWQS IPHPCYNSSDVEDHNHDLMLLQLRDQA 
SLGSKVKPISLADHCTQPGQKCTVSGWGTVTSPRENFPDTLNCAEVKIFPQKKCEDAYPGQI^^ 
IWCAGSSKGADTCQGDSGGPLVCDGALQGITSWGSDPaSRSDKPGVYTNICR^ 

Important Features: 
Signal peptide: 

amino acids 1-23 

Transmembrane domain : 

amino acids 51-71 

N-glycosylation site. 

amino acids 110-113 

Serine proteases, trypsin family, histidine active site. 

amino acids 69-74 and 207-217 

Tyrosine kinase phosphorylation site. 

amino acids 182-188 

Kringle domain proteins motif 

amino acids 205-217 



FIGURE 160 

GGCGCCGGTGCACCGGGCGGGCTGAGCGCCTCCTGCGGCCCGGCCTGCGCGCCCCGGCCCGCCGC 

GCCGCCCACGCCCCAACCCCGGCCCGCGCCCCCTAGCCCCCGCCCGGGCCCGCGCCCGCGCCCGC 

GCCCAGGTGAGCGCTCCGCCCGCCGCGAGGCCCCGCCCCGGCCCGCCCCCGCCCCGCCCCGGCCG 

GCGGGGGAACCGGGCGGATTCCTCGCGCGTCAAACCACCTGATCCCATAAAACATTCATCCTCCC 

GGCGGCCCGCGCTGCGAGCGCCCCGCCAGTCCGCGCCGCCGCCGCCCTCGCCCTGTGCGCCCTGC 

GCGCCCTGCGCACCCGCGGCCCGAGCCCAGCCAGAGCCGGGCGGAGCGGAGCGCGCCGAGCCtCG 

TCCCGCGGCCGGGCCGGGGCCGGGCCGTAGCGGCGGCGCCTGGATGCGGACCCGGCCGCGGGGAG 

ACGGGCGCCCGCCCCGAAACGACTTTCAGTCCCCGACGCGCCCCGCCCAACCCCTAC GATGA AGA 

GGGCGTCCGCTGGAGGGAGCCGGCTGCTGGCATGGGTGCTGTGGCTGCAGGCCTGGCT^GGTGGC^ 

GCCCCATGCCCAGGTGCCTGCGTATGCTACAATGAGCCCAAGGTGACGACAAGCTGCCCCCAGCA 

GGGCCTGCAGGCTGTGCCCGTGGGCATCCCTGCTGCCAGCCAGCGCATCTTCCTGCACGGCAACC 

GCATCTCGCATGTGCCAGCTGCCAGCTTCCGTGCCTGCCGCAACCTCACCATCCTGTGGCTGCAC 

TCGAATGTGCTGGCCCGAATTGATGCGGCTGCCTTCACTGGCCTGGCCCTCCTGGAGCAGCTGGA 

CCTCAGCGATAATGCACAGCTCCGGTCTGTGGACCCTGCCACATTCCACGGCCTGGGCCGCCTAC 

ACACGCTGCACCTGGACCGCTGCGGCCTGCAGGAGCTGGGCCCGGGGCTGTTCCGCGGCCTGGCT 

GCCCTGCAGTACCTCTACCTGCAGGACTVACGCGCTGCAGGCACTGCCTGATGACACCTTC 

CCTGGGCAACCTCACACACCTCTTCCTGCACGGCAACCGCATCTCCAGCGTGCCCGAGCGCGCCT 

TCCGTGGGCTGCACAGCCTCGACCGTCTCCTACTGCACCAGAACCGCGTGGCCCATGTGCACCCG 

CATGCCTTCCGTGACCTTGGCCGCCTCATGACACTCTATCTGTTTGCCAACAATCTATCAGCGCT 

GCCCACTGAGGCCCTGGCCCCCCTGCGTGCCCTGCAGTACCTGAGGCTCAACGACAACCCCTGGG 

TGTGTGACTGCCGGGCACGCCCACTCTGGGCCTGGCTGCAGAAGTTCCGCGGCTCGTCCTCCGAG 

GTGCCCTGCAGCCTCCCGCAACGCCTGGCTGGCCGTGACCTCAAACGCCTAGCTGCCAATGACCT 

GCAGGGCTGCGCTGTGGCCACCGGCCCTTACCATCCCATCTGGACCGGCAGGGCCACCGATGAGG 

AGCCGCTGGGGCTTCCCAAGTGCTGCCAGCCAGATGCCGCTGACAAGGCCTCAGTACTGGAGCCT 

GGAAGACCAGCTTCGGCAGGCAATGCGCTGAAGGGACGCGTGCCGCCCGGTGACAGCCCGCCGGG 

CAACGGCTCTGGCCCACGGCACATCAATGACTCACCCTTTGGGACTCTGCCTGGCTCTGCTGAGC 

CCCCGCTCACTGCAGTGCGGCCCGAGGGCTCCGAGCCACCAGGGTTCCCCACCTCGGGCCCTCGC 

CGGAGGCCAGGCTGTTO^CGCAAGAACCGCACCCGCAGCC^CTGCCGTCTGGGCCAGGC^ 

CGGGGGTGGCGGGACTGGTGACTCAGAAGGCTCAGGTGCCCTACCCAGCCTCACCTGCAGCCTCA 

CCCCCCTGGGCCTGGCGCTGGTGCTGTGGACAGTGCTTGGGCCCTGC TGA CCCCCAGCGGACACA 

AGAGCGTGCTCAGCAGCCAGGTGTGTGTACATACGGGGTCTCTCTCCACGCCGCCAAGCCAGCCG 

GGCGGCCGACCCGTGGGGCAGGCCAGGCCAGGTCCTCCCTGATGGACGGCTGCCGCCCGCCACCC 

CCATCTCCACCCCATCATGTTTACAGGGTTCGGCGGCAGCGTTTGTTCCAGAACGCCGCCTCCCA 

CCCAGATCGCGGTATATAGAGATATGCATTTTATTTTACTTGTGTAAAAATATCGGACGACGTGG 

AATAAAGAGCTCTTTTCTTAAAAAAA 



FIGURE 161 



></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA44184 
xsubunit 1 of 1, 473 aa, 1 stop 
xMW: 50708, pi: 9.28, NX(S/T): 6 

MKRASAGGSRLLAWVLWLQAWQVAAPCPGACVCYNEPKVTTSCPQQGLQAVPVGIPAASQ 
GNRISHVPAASFRACRNLTILWLHSNVIJ^IDAAAFTGLALLEQLDLSDNAQLRSVDPATFHGLG 
-RLHTLHLDRCGLQELGPGLFRGLAALQYLYLQDNALQALPDDTFRDLGNLTHLFLHGNRISSVPE 
RAFRGLHSLDRLLLHQNRVAPIVHPHAFRDLGRLMTLYLFANNLSALPTEALAPLRALQYLRLNDN 
PWCDCRARPLWAWLQKFRGSSSEVPCSLPQRLAGRDLKRLAANDLQGCAVATGPYHPIWTGRAT 
DEEPLGLPKCCQPDAADKASVLEPGRPASAGNALKGRVPPGDSPPGNGSGPRHINDSPFGTLPGS 
AEPPLTAVRPEGSEPPGFPTSGPRRRPGCSRKNRTRSHCRLGQAGSGGGGTGDSEGSGALPSLTC 
SLTPLGIiALVLWTVLGPC 

Important features : 
Signal peptide: 

amino acids 1-26 

Leucine zipper pattern. 

amino acids 13 5.-156 

Glycosamlnoglycan attachment site. 

amino acids 436-439 

N-glycosylation site. 

amino acids 82-85, 179-183, 237-240, 372-375 and 423-426 
VWFC domain 

amino acids 411-425 * 




FIGURE 162 

GGAAGTCCACGGGGAGCTTGGATGCCAAAGGGAGGACGGCTGGGTCCTCTGGAGAGGACTACTCA 
CTGGCATATTTCTGAGGTATCTGTAGAATAACCACAGCCTCAGATACTGGGGACTTTACAGTCCC 
ACAGAACCGTCCTCCCAGGAAGCTGAATCCAGCAAGAACA ATGG AGGCCAGCGGGAAGCTCATTT 
GCAGACAAAGGCAAGTCCTTTTTTCCTTTCTCCTTTTGGGCTTATCTCTGGCGGGCGCGGCGGAA 
CCTAGAAGCTATTCTGTGGTGGAGGAAACTGAGGGCAGCTCCTTTGTCACCAATTTAGCAAAGGA 
CCTGGGTCTGGAGCAGAGGGAATTCTCCAGGCGGGGGGTTAGGGTTGTTTCCAGAGGGAACAAAC 
TACATTTGCAGCTCAATCAGGAGACCGCGGATTTGTTGCTAAATGAGAAATTGGACCGTGAGGAT 
CTGTGCGGTCACACAGAGCCCTGTGTGCTACGTTTCCAAGTGTTGCTAGAGAGTCCCTTCGAGTT 
TTTTCAAGCTGAGCTGCAAGTAATAGACATAAACGACCACTCTCCAGTATTTCTGGACAAACAAA 
TGTTGGTGAAAGTATCAGAGAGCAGTCCTCCTGGGACTACGTTTCCTCTGAAGAATGCCGAAGAC 
TTAGATGTAGGCCAAAACAATATTGAGAACTATATAATCAGCCCOy^CTCCTATTTTCGGGTCCT 
CACCCGCAAACGCAGTGATGGCAGGAAATACCCAGAGCTGGTGCTGGACAAAGCGCTGGACCGAG 
AGGAAGAAGCTGAGCTCAGGTTAACACTCACAGCACTGGATGGTGGCTCTCCGCCCAGATCTGGC 
ACTGCTCAGGTCTACATCGAAGTCCTGGATGTCAACGATAATGCCCCTGAATTTGAGCAGCCTTT 
CTATAGAGTGCAGATCTCTGAGGACAGTCCGGTAGGCTTCCTGGTTGTGAAGGTCTCTGCCACGG 
ATGTAGACACAGGAGTCAACGGAGAGATTTCCTATTCACTTTTCCAAGCTTCAGAAGAGATTGGC 
AAAACCTTTAAGATCTU^LTCCCTTGACAGGAGAAATTGAACTAAAAAAACT^ 

ACTTCAGTCCTATGAAGTCAATATTGAGGCAAGAGATGCTGGAACCTTTTCTGGAAAATGCACCG 

TTCTGATTCAAGTGATAGATGTGAACGACCATGCCCCAGAAGTTACCATGTCTGCATTTACCAGC 

CCAATACCTGAGAACGCGCCTGAAACTGTGGTTGCACTTTTCAGTGTTTCAGATCTTGATTCAGG 

AGAAAATGGGAAAATTAGTTGCTCCATTCAGGAGGATCTACCCTTCCTCCTGAAATCCGCGGAAA 

ACTTTTACACCCTACTAACGGAGAGACCACTAGACAGAGAAAGCAGAGCGGAATACAACATCACT 

ATCACTGTCACTGACTTGGGGACCGCTATGCTGATAACACAGCTCAATATGACCGTGCTGATCGC 

CGATGTCAATGACAACGCTCCCGCCTTCACCCAAACCTCCTACACCCTGTTCGTCCGCGAGAACA 

ACAGCCCCGCCCTGCACATCCGCTVGCGTCAGCGCTACAGACAGAGACTCAGGCACCT^ 

GTCACCTACTCGCTGCTGCCGCCCCAGGACCCGCACCTGCCCCTCACATCCCTGGTCTCCATCAA 

CGCGGACAACGGCCACCTGTTCGCCCTCAGGTCTCTGGACTACGAGGCCCTGCAGGGGTTCCAGT 

TCCGCGTGGGCGCTTCAGACCACGGCTCCCCGGCGCTGAGCAGCGAGGCGCTGGTGCGCGTGGTG 

GTGCTGGACGCCAACGACAACTCGCCCTTCGTGCTGTACCCGCTGCAGAACGGCTCCGCGCCCTG 

CACCGAGCTGGTGCCCCGGGCGGCCGAGCCGGGCTACCTGGTGACCAAGGTGGTGGCGGTGGACG 

GCGACTCGGGCCAGAACGCCTGGCTGTCGTACCAGCTGCTCAAGGCCACGGAGCTCGGTCTGTTC 

GGCGTGTGGGCGCACAATGGCGAGGTGCGCACCGCCAGGCTGCTGAGCGAGCGCGACGCGGCCAA 

GCACAGGCTGGTGGTGCTGGTCAAGGACAATGGCGAGCCTCCGCGCTCGGCCACCGCCACGCTGC 

ACGTGCTCCTGGTGGACGGCTTCTCCCAGCCCTACCTGCCTCTCCCGGAGGCGGCCCCGACCCAG 

GCCCAGGCCGACTTGCTCAeCGTCTACCTGGTGGTGGCGTTGGCCTCGGTGTCTTCGCTCTTCCT 

CTTTTCGGTGCTCCTGTTCGTGGCGGTGCGGCTGTGTAGGAGGAGCAGGGCGGCCTCGGTGGGTC 

GCTGCTTGGTGCCCGAGGGCCCCCTTCCAGGGCATCTTGTGGACATGAGCGGCACCAGGACCCTA 

TCCCAGAGCTACCAGTATGAGGTGTGTCTGGCAGGAGGCTCAGGGACCAATGAGTTCAAGTTCCT 

GAAGCCGATTATCCCCAACTTCCCTCCCCAGTGCCCTGGGAAAGAAATACAAGGAAATTCTACCT 

TCCCCAATAACTTTGGGTTCAATATTCAGTGACCATAGTTGACTTTTACATTCCATAGGTATTTT 

ATTTTGTGGCATTTCCATGCCAATGTTTATTTCCCCCAATTTGTGTGTATGTAATATTGTACGGA 

TTTACTCTTGATTTTTCTCATGTTCTTTCTCCCTTTGTTTTAAAGTGAACATTTACCTTTATTCC 

TGGTTCTT 



FIGURE 163 



</usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA48314 
<subunit 1 of 1, 798 aa, 1 stop 
<MW: 87552, pi: 4.84, NX(S/T): 5 

MEASGKLI CRQRQVLFS FLLLGLSLAGAAEPRSYSVVEETEGSS FVTNLAKDLGLEQREFSRRGV 

RWSRGNKLHLQLNQETADLLLNEKLDREDLCGHTEPCVLRFQVLLES PFEFFQAELQVI DINDH 

SPVFLDKQMLVKVSESSPPGTTFPLKNAEDLDVGQNNIENYIISPNSYFRVLTRKRSDGRKYPEL 

VLDKALDREEEAELRLTLTALDGGSPPRSGTAQVYIEVLDVNDNAPEFEQPFYRVQISEDSPVGF 

LWKVSATDVDTGVNGEISYSLFQASEEIGKTFKINPLTGEIELKKQLDFEKLQSYEVNIEARDA 

GTFSGKCTVLIQVIDVNDHAPEVTMSAFTSPIPENAPETWALFSVSDLDSGENGKISCSIQEDL 

PFLLKSAENFYTLLTERPLDRESRAEYNITITVTDLGTPMLITQLNMTVLIADWDN^ 

YTLFVRENNSPALHIRSVSATDRDSGTNAQVTYSLLPPQDPHLPLTSLVSINADNGHLFALRSLD 

YEALQGFQFRVGASDHGSPALSSEALVRWVLDANDNSPFVLYPLQNGSAPCTELVPRAAEPGYL 

VTKWAVDGDSGQNAWLSYQLLKATELGLFGWAHNGEWTARLLSERDAAKHRLVVLVKDN^ 

PRSATATLHVLLVDGFSQPYLPLPEAAPTQAQADLLTVYLVVAIiASVSSLFLFSVLLFVAVRLCR 

RSRAASVGRCLVPEGPLPGHLVDMSGTRTLSQSYQYEVCLAGGSGTNEFKFLKPIIPNFPPQCPG 

KE IQGNSTFPNNFGFNIQ 

Important features s 
Signal peptide: 

amino acids 1-26 

Tramsmembrane domain : 

amino acids 685-712 

Cadherins extracellular repeated domain signature. 

amino acids 122-132, 231-241, 336-346, 439-449 and 549-559 

ATP/GTP -binding site motif A (P-loop) . 

amino acids 285-292 

N-glycosylation site. 

amino acids 418-421, 436-439, 567-570 and 786-789 



FIGURE 164 

ACCCACGCGTCCGCCCACGCGTCCGCCCACGCGTCCGCCCACGCGTCCGCGCGTAGCCGTGCGCC 

GATTGCCTCTCGGCCTGGGC AATGG TGCCGGCTGCCGGTCGACGACCGCCCCGCGTCATGCGGCT 

CCTCGGCTGGTGGCAAGTATTGCTGTGGGTGCTGGGACTTCCCGTCCGCGGCGTGGAGGTTGCA 

AGGAAAGTGGTCGCTTATGGTCAGAGGAGCAGCCTGCTCACCCTCTCCAGGTGGGGGGTGTGTAC 

CTGGGTGAGGAGGAGCTCCTGCATGACCCGATGGGCCTVGGACAGGGCAGCAGAAGAGGCC^ 

GGTGCTGGGGCTGGACACCCAAGGCGATCACATGGTGATGCTGTCTGTGATTCCTGGOT 

AGGACAAAGTGAGTTCAGAGCCTAGCGGCGTCACCTGTGGTGCTGGAGGAGCGGAGGACTCAAC^ 

TGCAACGTCCGAGAGAGCCTTTTCTCTCTGGATGGCGGTGGAGC:i?^CACTTCCCTGACAGAG^ 

GGAGTATTACACAGAGCCAGAAGTGGCGGAATCTGACGCAGCCCCGACAGAGGACTCCAATAACA 

CTGAAAGTCTGAAATCCCCAAAGGTGAACTGTGAGGAGAGAAACATTACAGGATTAGAAAATTTC 

ACTCTGAAAATTTTAAATATGTCACAGGACCTTATGGATTTTCTGAACCCAAACGGTAGTGACTG 

TACTCTAGTCCTGTTTTACACCCCGTGGTGCCGCTTTTCTGCCAGTTTGGCCCCTCACTTTAACT 

CTCTGCCCCGGGCATTTCCAGCTCTTCACTTTTTGGCACTGGATGCATCTCAGCACAGCAGCCTT 

TCTACCAGGTTTGGCACCGTAGCTGTTCCTAATATTTTATTATTTCAAGGAGCTAAACCAATGGC 

CAGATTTAATCATACAGATCGAACACTGGAAACACTGAAAATCTTCATTTTTAATCAGACAGGTA 

TAGAAGCCAAGAAGAATGTGGTGGTAACTCAAGCCGACCAAATAGGCCCTCTTCCCAGCACTTTG 

ATAAAAAGTGTGGACTGGTTGCTTGTATTTTCCTTATTCTTTTTAATTAGTTTTATTATGTATGC 

TACCATTCGAACTGAGAGTATTCGGTGGCTAATTCCAGGACAAGAGCAGGAACATGTGGA GTAGT 

GATGGTCTGAAAGAAGTTGGAAAGAGGAACTTCAATCCTTCGTTTCAGAAATTAGTGCTACAGTT 

TCATACATTTTCTCCAGTGACGTGTTGACTTGAAACTTCAGGCAGATTAAAAGAATCATT^ 

AACAACTGAATGTATAAAAAAATTATAAACTGGTGTTTTAACTAGTATTGCAATAAGCAAATGCA 

AAAATATTCAATAG 




FIGURE 165 

></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA48333 
xsubunit 1 of 1, 360 aa, 1 stop 
xMW: 39885, pi: 4.79, NX(S/T): 7 

^^PAAGRRPPRVMRLLGWWQVLLVmJGLPVRGVEVAEESGRLWSEEQPAHPLQV^ 

HDPMGQDRAAEEANAVLGLDTQGDIMVMLSVIPGEAEDKVSSEPSGVTCGAGGAEDSRCN^ 

FSLDGAGAHFPDREEEYYTEPEVAESDAAPTEDSNNTESLKSPKVNCEERNITGLENFTLKILNM 

SQDLMDFLNPNGSDCTLVLFYTPWCRFSASLAPHFNSLPRAFPALHFIiALDASQHSSLSTRFGTV 

AVPNI LLFQGAKPMARFNHTDRTLETLKI F I FNQTG I EAKKNVWTQADQ IGPLPSTL I KSVDWL 

LVFSLFFLISFIMYATIRTESIRWLIPGQEQEHVE 

Important features : 
Signal peptide: 

amino acids 1-25 

Transmembrane domain: 

amino acids 321-340 

Homologous region to dilsuf ide isomerase 

amino acids 212-302 

N-glycosylation site. 

amino acids 165-168, 181-184, 187-190, 194-197, 206-209, 278-281 
and 293-296 

Thloredoxin domain 

amino acids 211-227 



FIGURE 166 

CCCGGCTCCGCTCCCTCTGCCCCCTCGGGGTCGCGCGCCCAC GATG CTGCAGGGCCCTGGCTCGC 
TGCTGCTGCTCTTCCTCGCCTCGCACTGCTGCCTGGGCTCGGCGCGCGGGCTCTTCCTCTTTGGC 
CAGCCCGACTTCTCCTACAAGCGCAGCAATTGCAAGCCCATCCCGGTCAACCTGC^ 
CGGCATCGAATACCAGAACATGCGGCTGCCCAACCTGCTGGGCCACGAGACCATGAAGGAGGTGC 
TGGAGCAGGCCGGCGCTTGGATCCCGCTGGTCATGAAGCAGTGCCACCCGGACACC^ 
CTGtGCTCGCTCTTCGCCCCCGTCTGCCTCGATGACCTAGACGAGACCATCCAGCCATGCCACTC 
GCTCTGCGTGCAGGTGAAGGACCGCTGCGCCCCGGTCATGTCCGCCTTCGGCTTCCCCTGGCCCG 
ACATGCTTGAGTGCGAGCGTTTCCCCCAGGACAACGACCTTTGCATCCCCCTCGCTAGCAGCGAC 
CACCTCCTGCCAGCCACCGAGGAAGCTCCAAAGGTATGTGAAGCCTGCAAAAATAAAAATGATGA 
TGACAACGACATAATGGAAACGCTTTGTAAAAATGATTTTGCACTGAAAATAAAAGTGAAGGAGA 
TAACCTACATCAACCGAGATACCAAAATCATCCTGGAGACCAAGAGCAAGACCATTTACAAGCTG 
AACGGTGTGTCCGAAAGGGACCTGAAGAAATCGGTGCTGTGGCTCAAAGACAGCTTGCAGTGCAC 
CTGTGAGGAGATGAACGACATCAACGCGCCGTATCTGGTCATGGGACAGAAACAGGGTGGGGAGC 
TGGTGATCACCTCGGTGAAGCGGTGGCAGAAGGGGCAGAGAGAGTTCAAGCGCATCTCCCGCAGC 
ATCCGCAAGCTGCAGTGC TAG TCCCGGCATCCTGATGGCTCCGACAGGCCTGCTCCAGAGCACGG 
CTGACCATTTCTGCTCCGGGATCTCAGCTGCCGTTCCCCAAGCACACTCCTAGCTGCTCCAGTCT 
. CAGCCTGGGCAGCTTCCCCCTGCCTTTTGCACGTTTGCATCCCCAGCATTTCCTGAGTTATAAGG 
CCACAGGAGTGGATAGCTGTTTTCACCTAAAGGAAAAGCCCACCGGAATCTTGTAGAAATATTCA 
AACTAATAAAATCATGAATATTTTAA 



FIGURE 167 

></usr/seqdb2/sst/DNA/Dnaseqs .min/ss . DNA50920 
xsubunit 1 of 1, 295 aa, 1 stop 
xMW: 33518, pi: 7,74, NX(S/T): 0 

MLQGPGSLLLLFLASHCCLGSARGLFLFGQPDFSYKRSNCKPIPVNLQLCHGIEYQNMRLPNLLG 
HETMKEVLEQAGAWIPLWKQCHPDTKKFLCSLFAPVCLDDLDETIQPCHSLCVQVKDRCAPVMS 
AFGFPWPDMLECDRFPQDNDLCIPIASSDHLLPATEEAPKVCEACKNKNDDDNDIMETLCKN 
LKIKVKEITYINRDTKIILETKSKTIYKLNGVSERDLKKSVLWLKDSLQCTCEEMNDINAPYLVM 
GQKQGGELVI TS VKRWQKGQREFKRI SRS I RKLQC 

Important features : 
Signal peptide: 

amino acids 1-20 

Cysteine rich domain, homolgous to frizzled N terminus 

amino acids 6-153 



FIGURE 168 



GTGGAGGCCGCCGAC GATGG CGGGGCCGACGGAGGCCGAGACGGGGTTGGCCGAGCCCCGGGCCC 

TGTGCGCGCAGCGGGGCCACCGCACCTACGCGCGCCGCTGGGTGTTCCTGCTCGCGATCAGCCTG 

CTCAACTGCTCCAACGCCTVCGCTGTGGCTCTVGCTTTGCACCTGTGGCTGAC^^ 

CTTGGTCCTGTCCATGGAGCT^GATCAACTGGCTGTaVCTGGTCTACCTCGTGGTATCC^ 

TTGGCGTGGCGGCCT^TCTGGATCCTGGACTCCGTCGGGCTCCGTGCGGCGACCT^TCCTGGGTGCG 

TGGCTGAACTTTGCCGGGAGTGTGCTACGCATGGTGCCCTGCATGGTTGTTGGGACCCAAAACCC 

ATTTGCCTTCCTCATGGGTGGCCAGAGCCTCTGTGCCCTTGCCCAGAGCCTGGTCATCTTCTCTC 

CAGCCAAGCTGGCTGCCTTGTGGTTCCCAGAGCACCT^GCGAGCCACGGCCA^ 

ATGTCGAACCCTCTGGGCGTCCTTGTGGCCAATGTGCTGTCCCCTGTGCTGGTCAAGAAGGGTGA 

GGACATTCCGTTAATGCTCGGTGTCTATACCATCCCTGCTGGCGTCGTCTGCCTGCTGTCCACCA 

TCTGCCTGTGGGAGAGTGTGCCCCCCACCCCGCCCTCTGCCGGGGCTGCCAGCTCCACCTCAGAG 

AAGTTCCTGGATGGGCTCAAGCTGCAGCTCATGTGGAACAAGGCCTATGTCATCCTGGCTGTGTG 

CTTGGGGGGAATGATCGGGATCTCTGCCAGCTTCTCAGCCCTCCTGGAGCAGATCCTCTGTGCAA 

GCGGCCACTCCAGTGGGTTTTCCGGCCTCTGTGGCGCTCTCTTCATCACGTTTGGGATCCTGGGG 

GCACTGGCTCTCGGCCCCTATGTGGACCGGACCAAGCACTTCACTGAGGCCACCAAGATTGGCCT 

GTGCCTGTTCTCTCTGGCCTGCGTGCCCTTTGCCCTGGTGTCCCAGCTGCAGGGACAGACCCTTG 

CCCTGGCTGCCACCTGCTCGCTGCTCGGGCTGTTTGGCTTCTCGGTGGGCCCCGTGGCCATGGAG 

TTGGCGGTCGAGTGTTCCTTCCCCGTGGGGGAGGGGGCTGCCACAGGCATGATCTTTGTGCTG^ 

GCAGGCCGAGGGAATACTO^TCATGCTGGCAATGACGGCACTGACTGTGCC^ 

CCTTGTCCACCTGCCAGCAGGGGGAGGATCCACTTGACTGGACAGTGTCTCTGCTGCTGATGGCC 

GGCCTGTGCACCTTCTTCAGCTGCATCCTGGCGGTCTTCTTCCACACCCCATACCGGCGCCTGCA 

GGCCGAGTCTGGGGAGCCCCCCTCCACCCGTAACGCCGTGGGCGGCGCAGACTCAGGGCCGGGTG 

TGGACCGAGGGGGAGCAGGAAGGGCTGGGGTCCTGGGGCCCAGCACGGCGACTCCGGAGTG^ 

GCGAGGGGGGCCTCGCTAGAGGACCCCAGAGGGCCCGGGAGCCCCCACCCAGCCTGCCACCGAGC 

GACTCCCCGTGCGCAAGGCCCAGCTVGCCTVCCGACGCGCCCTCCCGCCCCGGCTVGACTCGCA 

GGGTCCAAGCGTCCAGGTTTATTGACCCGGCTGGGTCTCACTCCTCCTTCTCCTCCCCGTGG^ 

ATCAC GTAG CTGAGCGCCTTGTAGTCCAGGTTGCCCGCCACATCGATGGAGGCGAACTGGAACAT 

CTGGTCCACCTGCGGGCGGGGGCGAAAGGGCTCCTTGCGGGCTCCGGGAGCGAATTACT^GCGCG 

CACCTGAAAA 



FIGURE 169 



>< /us r / s eqdb2 / s s t /DNA/Dnas eqs . min/ s s . DNA5 0 9 8 8 
xsubunit 1 of 1, 560 aa, 1 stop 
xMW: 58427, pi: 6.86, NX(S/T): 2 

MAGPTEAETGIAEPRALCAQRGHRTYARRWFLIAISLLNCSNATLWLSFAPVADVIAEDLVLSM 

EQINWLSLVYLWSTPFGVAAIWILDSVGLRAATILGAWLNFAGSVLRIWPCMWGTQNPFAFLM 

GGQSLCALAQSLVI FS PAKLAALWFPEHQRATANMLATMSNPLGVLVANVLS PVLVKKGEDI PLM 

LGVYTIPAGWCLLSTICLWESVPPTPPSAGAT^STSEKFLDGLKLQLMWNKAYVILAVCLGGMI 

GISASFSALLEQILCASGHSSGFSGLCGALFITFGILGALALGPYVDRTKHFTEATKIGLCLFSL 

ACVPFALVSQLQGQTLALAATCSLLGLFGFSVGPVAMELAVECSFPVGEGAATGMIFV^ 

LIMLAMTALTWRSEPSLSTCQQGEDPLDWTVSLLLMAGLCTFFSCILAVFFHTPYRRLQAESGE 

PPSTRNAVGGADSGPGVDRGGAGRAGVLGPSTATPECTARGASLEDPRGPGSPHPACHRATPRAQ 

GPAATDAPSRPGRIiAGRVQASRF I D PAGSH S S FS S PWVI T 

Important features: 
Signal peptide: 

amino acids 1-44 

Transmembrane domains : 

amino acids 61-79, 98-112, 126-146, 169-182, 201-215, 248-268, 
280-300, 318-337, 341-357, 375-387, 420-441 

N-glycosylation site • 

amino acids 40-43 and 43-46 

Glycosaminoglycan attachment site. 

amino acids 468-471 




FIGURE 170 

GTCCCACATCCTGCTCAACTGGGTCAGGTCCCTCTTAGACCAGCTCTTGTCCATCATTTGCTGAAGTGGACCAACTAG 

TTCCCCAGTAGGGGGTCTCCCCTGGCAATTCTTGATCGGCGTTTGGACATCTCAGATCGCTTCCAATGAAGATGGCCT 

TGCCTTGGGGTCCTGCTTGTTTCATAATCATCTAACTATGGGACAAGGTTGTGCCGGCAGCTCTGGGGGAAGGAGCAC 

GGGGCTGATCAAGCCATCCAGGAAACACTGGAGGACTTGTCCAGCCTTGAAAGAACTCTAGTGGTTTCTGAATCTAGC 

CCACTTGGCGGTAAGC ATGA TGCAACTTCTGCAACTTCTGCTGGGGCTTTTGGGGCCAGGTGGCTACTTATTTCTTTT 

AGGGGATTGTCAGGAGGTGACCTVCTCTCACGGTGAAATACCAAGTGTCAGAGGAAGTGCCATCTGGTAC^ 

GAAGCTGTCCCAGGAACTGGGCCGGGAGGAGAGGCGGAGGCAAGCTGGGGCCGCCTTCCAGGTGTTGCAGCTC 

GGCGCTCCCCATTCAGGTGGACTCTGAGGAAGGCTTGCTCAGCACAGGCAGGCGGCTGGATCGAGAGCA 

ACAGTGGGATCCCTGCCTGGTTTCCTTTGATGTGCTTGCCACAGGGGATTTGGCTCTGATCCATGT 

G.CTGGACATCAATGACCACCAGCCACGGTTTCCCAAAGGCGAGCAGGAGCTGGAAATCTCTGAGAGCGCCTCTCTGCG 

AACCCGGATCCCCCTGGACAGAGCTCTTGACCCAGACACAGGCCCTAACACCCTGCACACCTACACTCTGTCTCCCAG 

TGAGCACTTTGCCTTGGATGTCATTGTGGGCCCTGATGAGACCAAACATGCAGAACTCATAGT^ 

CAGGGAAATCCATTCATTTTTTGATCTGGTGTTAACTGCCTATGACAATGGGAACCCCCCCAAGTCAGGTACCAGCTT 

GGTCAAGGTCAACGTCTTGGACTCCAATGACAATAGCCCTGCGTTTGCTGAGAGTTCACTGGCACTGGTyUVTCCAAGA 

AGATGCTGCACCTGGTACGCTTCTCATAAAACTGACCGCCACAGACCCTGACCAAGGCCCCAATGGGGAGGTGGAGTT 

CTTCCTCAGTAAGCACATGCCTCCAGAGGTGCTGGACACCTTCAGTATTGATGCCAAGACAGGCGAGGTCATTCTGCG 

TCGACCTCTAGACTATGAAAAGAACCCTGCCTACGAGGTGGATGTTCAGGCAAGGGACCTGGGTCCCAATCCTATCCC 

AGCCCATTGCAAAGTTGTCATCAAGGTTCTGGATGTCAATGACAACATCCCAAGCATCCACGTCACATGGGCCTCCCA 

GCCATCACTGGTGTCAGAAGCTCTTCCCAAGGACAGTTTTATTGCTCTTGTCATGGCAGATGACTTGGATTCAGGACA 

CAATGGTTTGGTCCACTGCTGGCTGAGCCAAGAGCTGGGCCACTTCAGGCTGAAAAGAACTAATGGCAACACATACAT 

GTTGCTAACCAATGCCACACTGGACAGAGAGCAGTGGCCCAAATATACCCTCACTCTGTTAGCCCAAGACCAAGGACT 

CCAGCCCTTATCAGCCAAGAAACAGCTCAGCATTCAGATCAGTGACATCAACGACAATGCACCTGTGTTTGAGAAAAG 

CAGGTATGAAGTCTCCACGCGGGAAAACAACTTACCCTCTCTTCACCTCATTACCATCAAGGCTCATGATGCAGACTT 

GGGCATTAATGGAAAAGTCTCATACCGCATCCAGGACTCCCCAGTTGCTCACTTAGTAGCTATTGACTCCAACACAGG 

AGAGGTCACTGCTCAGAGGTCACTGAACTATGAAGAGATGGCCGGCTTTGAGTTCCAGGTGATCGCAGAGGACAGCGG 

GCAACCCATGCTTGCATCCAGTGTCTCTGTGTGGGTCAGCCTCTTGGATGCCAATGATAATGCCCCAGAGGTGGTCCA 

GCCTGTGCTCAGCGATGGAAAAGCCAGCCTCTCCGTGCTTGTG7VATGCCTCCACAGGCCACCTGCTGGTGCCCATCGA 

GACTCCCAATGGCTTGGGCCCAGCGGGCACTGACACACCTCCACTGGCCACTCACAGCTCCCGGCCATTCCTTTTGAC 

T^CCATTGTGGCAAGAGATGCAGACTCGGGGGCAAATGGAGAGCCCCTCTACAGCATCCGCAATGGAAATGAAGCCCA 

CCTCTTCATCCTO^CCCTCATACGGGGCAGCTGTTCGTCAATGTCACCAATGCCAGCyVGCCTCATTGGGA^^ 

GGAGCTGGAGATAGTAGTAGAGGACCAGGGAAGCCCCCCCTTACAGACCCGAGCCCTGTTGAGGGTCATGTTTGTCAC 

CAGTGTGGACCACCTGAGGGACTCAGCCCGCAAGCCTGGGGCCTTGAGCATGTCGATGCTGACGGTGATC 

TGTACTGTTGGGCATCTTCGGGTTGATCCTGGCTTTGTTCATGTCCATCTGCCGGACAGAAAAGAAGGACAACAC^ 

CTACAACTGTCGGGAGGCCGAGTCCACCTACCGCCAGCAGCCCAAGAGGCCCC7VGAAACACATTCAGAAGGCAGACAT 

CCACCTCGTGCCTGTGCTCAGGGGTCAGGCAGGTGAGCCTTGTGAAGTCGGGCAGTCCCACAAAGATGTGG^ 

GGCGATGATGGAAGCAGGCTGGGACCCCTGCCTGCAGGCCCCCTTCCACCTCACCCCGACCCTGTACAGGACGCTGCG 

TAATCTlAGGC^CCAGGGAGCACCGGCGGAGAGCCGAGAGGTGCTGCAAGACT^CGGTCAACCTCCTTTTCi^ 

CAGGCAGAGGAATGCCTCCCGGGAGAACCTGAACCTTCCCGAGCCCCAGCCTGCCACAGGCCAGCCACGTTCCAGGCC 

TCTGAAGGTTGCAGGCAGCCCCACAGGGAGGCTGGCTGGAGACCAGGGCAGTGAGGAAGCCCCACAGAGGCCACCAGC 

CTCCTCTGCAACCCTGAGACGGCAGCGACATCTCAATGGCAAAGTGTCCCCTGAGAAAGAATCAGGGCCCCGTCAGAT 

CCTGCGGAGCCTGGTCCGGCTGTCTGTGGCTGCCTTCGCCGAGCGGAACCCCGTGGAGGAGCTCACTGTGGATTCTCC 

TCCTGTTCAGCAAATCTCCCAGCTGCTGTCCTTGCTGCATCAGGGCCAATTCCAGCCCAAACCAAACCACCGAGGAAA 

TAAGTACTTGGCCAAGCCAGGAGGCAGCAGGAGTGCAATCCCAGACACAGATGGCCCAAGTGCAAGGGCTGGAGGCCA 

GACAGACCCAGAACAGGAGGAAGGGCCTTTGGATCCTGAAGAGGACCTCTCTGTGAAGCAACTGCTAGAAGAAGAGCT 

GTCAAGTCTGCTGGACCCCAGCACAGGTCTGGCCCTGGACCGGCTGAGCGCCCCTGACCCGGCCTGGATGGCGAGACT 

CTCTTTGCCCCTCACCACCAACTACCGTGACAATGTGATCTCCCCGGATGCTGCAGCCACGGAGGAGCCGAGGACCTT 

CCAGACGTTCGGCAAGGCAGAGGCACCAGAGCTGAGCCCAACAGGCACGAGGCTGGCCAGCACCTTTGTCTCGGAGAT 

GAGCTCACTGCTGGAGATGCTGCTGGAACAGCGCTCCAGCATGCCCGTGGAGGCCGCCTCCGAGGCGCTGCGGCGGCT 

CTCGGTCTGCGGGAGGACCCTCAGTTTAGACTTGGCCACCAGTGCAGCCTCAGGCATGAAAGTGCAAGGGGACCCAGG 

TGGAAAGACGGGGACTGAGGGCAAGAGCAGAGGCAGCAGCAGCAGCAGCAGGTGCCTG TGAA CATACCTCAGACGCCT 

CTGGATCCAAGAACCAGGGGCCTGAGGATCTGTGGACAAGAGCTGGTTTCTAAAATCTTGTAACTCACTAGCTAGCGG 

CGGCCTGAGAACTTTAGGGTGACTGATGCTACCCCCACAGAGGAGGCAAGAGCCCCAGGACTAACAGCTGACTGACCA 

AAGCAGCCCCTTGTAAGCAGCTCTGAGTCTTTTGGAGGACAGGGACGGTTTGTGGCTGAGATAAGTGTTTTC 

AACATATGTGGAGCACAAAGGGTCAGTCCTCTGGCAGAACAGATGCCACGGAGTATCACAGGCAC^^ 

TCTTGGGTAGCAGGAGTCAGGGGGCTGTACCCTGGGGGTGCCAGGAAATGCTCTCTGACCTATCAATAAAGGAAA^ 

AGTAAAAAAAAAAAAAAAAAAAA 



FIGURE 171 



</usr/seqdb2/sst/DNA/Dnaseqs -min/ss .DNA48331 
<subunit 1 of 1, 1184 aa, 1 stop 
<MW: 129022, pi: 5.20, NX(S/T): 5 

MMQLLQLLLGLLGPGGYLFLLGDCQEVTTLTVKYQVSEEVPSGTVIGKLSQELGREERRRQAGAA 

FQVLQL PQAL P I QVDS EEGLLSTGRRLDREQLCRQWDPCLVS FDVLATGDLAL I HVE I QVLD IND 

HQPRFPKGEQELEI SESASLRTRI PLDRALDPDTGPNTLHTYTLSPSEHFALDVI VGPDETKHAE 

LIWKELDREIHSFFDLVLTAYDNGNPPKSGTSLVKVNVLDSimNSPAFAESSIiALEIQEDAAPG 

TLLI KLTATDPDQGPNGEVEFFLSKHMPPEVLDTFS IDAKTGQVILRRPLDYEKNPAYEVDVQAR 

DLGPNPIPAHCKVLIKVLDWDNIPSIHVTWASQPSLVSEALPKI^SFIALVMAD 

CWLSQELGHFRLKRTNGNTYMLLTNATLDREQWPKYTLTLLxAQDQGLQPLSAKKQLSIQISDIND 

NAPVFEKSRYEVSTRElsnsrLPSLHLITIKAHDADLGINGKVSYRIQDSPVAHLVAIDSNTGEVTAQ 

RSLNYEEMAGFEFQVIAEDSGQPMLASSVSVWVSLLDANDNAPEWQPVLSDGKASLSVLVNAST 

GHLLVPIETPNGLGPAGTDTPPLATHSSRPFLLTTIVARDADSGANGEPLYSIRNGNEAHLFILN 

PHTGQLFVlSr^NASSL IGSEWELE I WEDQGS PPLQTRALLRVMFVTSVDHLRDSA^ 

MLWICLAVLLGIFGLIIiALFMSICRTEKKDNRAYNCREAESTYRQQPKRPQKHIQKADIH 

LRGQAGEPCEVGQSHKDVDKEAMMEAGWDPCLQAPFHLTPTLYRTLRNQGNQGAPAESREVLQDT 

VNLLFNHPRQRNASRENLNLPEPQPATGQPRSRPLKVAGSPTGRLAGDQGSEEAPQRPPASSATL 

RRQRHLNGKVSPEKESGPRQILRSLVRLSVAAFAERNPVEELTVDSPPVQQISQLLSLLHQGQFQ 

PKPNHRGNKYLAKPGGSRSAIPDTDGPSARAGGQTDPEQEEGPLDPEEDLSVKQLLEEELSSLLD 

PSTGLALDRLSAPDPAWMARLSLPLTTSr^RDNVI S PDAAATEEPRTFQTFGKAEAPELS PTG 

ASTFVSEMSSLLEMLLEQRSSMPVEAASEALRRLSyCGRTLSLDLATSAASGMKVQGDPGGKTGT 

EGKSRGSSSSSRCL 

Important features: 
Signal peptide: 
amino acids 1-13 
Transmembrane domain: 
amino acids 719-739 
N-glycosylation site. 

amino acids 415-418, 582-585, 659-662, 662-665 amd 857-860 
Cadherins extracellular repeated domain signature. 

amino acids 123-133, 232-242, 340-350, 448-458 and 553-563 



FIGURE 172 



CGGACGCGTGGGCGGACGCGTGGGGGAGAGCCGCAGTCCCGGCTGCAGCACCTGGGAGAAGGCAG 

ACCGTGTGAGGGGGCCTGTGGCCCCAGCGTGCTGTGGCCTCGGGGAGTGGGAAGT^^ 

GCCTTCCTTACACTTCGCC ATGA GTTTCCTCATCGACTCCAGCATCATGATTACCTCCCAGATAC 

TATTTTTTGGATTTGGGTGGCTTTTCTTCATGGGCC^TTGTTTAAAGACTATGAGATACGT(^ 

TATGTTGTACAGGTGATCTTCTCCGTGACGTTTGCATTTTCTTGCACCATGTTTGAGCTC^ 

CTTTGAAATCTTAGGAGTATTGAATAGCAGCTCCCGTTATTTTCACTGGAAAATGAACCTGTGTG 

TAATTCTGCTGATCCTGGTTTTCATGGTGCCTTTTTACATTGGCTATTTTATTGTGAGCAATA^ 

CGACTACTGCATAAACAACGACTGCTTTTTTCCTGTCTCTTATGGCTGACCTTTATGTATTTCTT 

CTGGAAACTAGGAGATCCCTTTCCCATTCTCAGCCCAAAACATGGGATCTTATCCATAGAACAGC 

TCATCAGCCGGGTTGGTGTGATTGGAGTGACTCTCATGGCTCTTCTTTCTGGATTTGGTGCTGTC 

AACTGCCCATACACTTACATGTCTTACTTCCTCAGGAATGTGACTGACACGGATATTCTAGCCCT 

GGAACGGCGACTGCTGCAAACCATGGATATGATCATAAGCAAAAAGAAAAGGATGGC?U^TGG^ 

GGAGAACAATGTTCCAGAAGGGGGAAGTGCATAACAAACCATCAGGTTTCTGGGG^ 

AGTGTTACCACTTCAGCATCAGGAAGTGAAAATCTTACTCTTATTCAACAGGAAGTGGATGCTTT 

GGAAGAATTAAGCAGGCAGCTTTTTCTGGAAACAGCTGATCTATATGCTACCAAGGAGAGAATAG 

AATACTCCAAAACCTTCAAGGGGAAATATTTTAATTTTCTTGGTTACTTTTTCTCTATTTACTGT 

GTTTGGAAAATTTTCATGGCTACCATCAATATTGTTTTTGATCGAGTTGGGAAAACGGATCCTGT 

CACAAGAGGCATTGAGATCACTGTGAATTATCTGGGAATCCAATTTGATGTGAAGTTTTGGTCCC 

AACACATTTCCTTCATTCTTGTTGGAATAATCATCGTCACATCCATCAGAGGATTGCTGATCACT 

CTtACCAAGTTCTTTTATGCCATCTCTAGCAGTAAGTCCTCCAATGTCATTGTCCTGCTATTAGC 

ACAGATAATGGGCATGTACTTTGTCTCCTCTGTGCTGCTGATCCGAATGAGTATGCCTTTAGAAT 

ACCGCACCATAATCACTGAAGTCCTTGGAGAACTGCAGTTCAACTTCTATCACCGTTGGTTTGAT 

GTGATCTTCCTGGTCAGCGCTCTCTCTAGCATACTCTTCCTCTATTTGGCTCACAAACAGGCACC 

AGAGAAGCAAATGGCTICC TTGAA CTTAAGCCTACTACAGACTGTTAGAGGCCAGTGGTTTCAAAA 

-TTTAGATATAAGAGGGGGGAAAAATGGAACCAGGGCCTGACATTTTATAAAC^^ 

TGGTAGCATTTTTCACCTTCATAGCATACTCCTTCCCCGTCAGGTGATACTATGACCATGAGTAG 

CATCAGCCAGAACATGAGAGGGAGAACTAACTCT^GACAATACTCAGCAGAGAGCATCC^ 

GATATGAGGCTGGTGTAGAGGCGGAGAGGAGCCAAGAAACTAAAGGTGAAAAATACACTG^^ 

CTGGGGO^GACATGTCTATCXSTAGCTGAGCCAAACACGTAGGATTTCCGTTTTAA 

GGAAAAGGTTATAGCTTTGCCTTGAGATTGACTCATTAAAATCAGAGACTGTAACAAAAAAAAAA 

AAAAAAAAAAAGGGCGGCCGCGACTCTAGAGTCGACCTGCAGAAGCTTGGCCGCCATGGCCCAAC 

TTGTTTATTGCAGCTTATAATG 



FIGURE 173 

MSFLIDSSIMITSQILFFGFGWLFFMRQLFKDYEIRQYWQVIFSVTFAFSCTMFELIIFEILGV 
LNSSSRYFHWKMNLCVILLILVFMVPFYIGYFIVSNIRLLHKQRLLFSCLLWLTFMYFFWKLGDP 
FP I LS PKHG I LS I EQL I SRVGVIGVTLMALLSGFGAVNCP YTYMS YFLRNVTDTDI LALERRLLQ 
TMDMI I SKKKRMAMARRTMFQKGEVHNKPSGFWGMI KS VTTSASGSENLTLIQQEVDALEELSRQ 
LFLETADLYATKERIEYSKTFKGKYFNFLGYFFSIYCVWKIFMATINIVFDRVGKTDPVTRGIEI 
TVlSr^LGIQFDVKFWSQHISFILVGIIIVTSIRGLLITLTKFFYAISSSKSSNVIVLLIA 
FVSSVLLIRMSMPLEYRTI ITEVLGELQFNFYHRWFDVI FLVSALSS ILFLYLAHKQAPEKQMAP 

Important features : 
Signal peptide: 

amino acids 1-23 

Potential transmembrane domains: 

amino acids 37-55, 81-102, 150-168, 288-311, 338-356, 375-398, 
425-444 

N-glycosylation sites. 

amino acids 67-70, 180-183 and 243-246 

Eukaryotic cobalamin-binding proteins 

amino acids 151-160 



FIGURE 174 



CATGGGAAGTGKSAGCCGGAGCCTTCCTTACACTCGCCATGAGTTTCCTCATCGACTCCAGC^ 

TGATTACCTCCCNGANACTATTTTTTGGATTTGGGTGGCTTTTCTTCNGCGCCT^TC 

CTATGAGATACGTCAGTATGTTGTACNGGTGATCTTCTCCGTGACGTTTGCCATTTCTTGCACCA 

TGTTTGAGCTCATCATCTTTGAAATCTTNGGAGTATTGAATAGCAGCTCCCGTTATTTTCACTGG 

AAAATGAACCTGTGTGTAATTCTGCTGATCCTGGTTNTCATGGTGCCTTTTTACATTGGCTATTT 



CCTTTATGTATTTCCAG 



FIGURE 175 

GTGTTGCCCTTGGGGAGGGGAAGGGGAGCCNGGCCCTTTCCTAAAATTTGGCCAAGGGT^ 

ATGCGCCAATTGTTTAAAGACTATGAGATACGTCAGTATGTTGTACAGGTGATNTTOT 

GTTTGCATTTTCTTGCACCATGTTTGAGCTCATCATNTTTGAAATNTTAGGAGTATTG^ 

GCTCCCGTTATTTTCACTGGAAAATGAACCTGTGTGTAATTCTGCTGATCCTGGTTTTCAT^ 

CCTTTTTACATTGGCTATTTTATTGTGAGCAATATCCGACTACTGCATAAACAACGACTGCTTTT 

TTCCTGTCTNTTATGGCTGACCTTTATGTATTTNTTlSrrGGAAANTAGGAGATCCCTTTCCC^ 

TC 




FIGURE 17 6 

CTCGCGCAGGGATCGTCCC ATG GCCGGGGCTCGGAGCCGCGACCCTTGGGGGGCCTCCGGGATTTGCTACCTTTTTGG 

CTCCCTGCTCGTCGAACTGCTCTTCTCACGGGCTGTCGCCTTCAATCTGGACGTGATGGGTGCCTTGCGCAAGGAGGG 

CGAGCCAGGCAGCCTCTTCGGCTTCTCTGTGGCCCTGCACCGGCAGTTGCAGCCCCGACCCCAGAGCTGGCTGCTGGT 

GGGTGCTCCCCAGGCCCTGGCTCTTCCTGGGCAGCAGGCGAATCGCACTGGAGGCCTCTTCGCTTGCCCGTTGAGCCT 

GGAGGAGACTGACTGCTACAGAGTGGACATCGACCAGGGAGCTGATATGCAAAAGGAAAGCAAGGAGAACCAGTGGTT 

GGGAGTCAGTGTTCGGAGCCAGGGGCCTGGGGGCAAGATTGTTACCTGTGCACACCGATATGAGGCAAGGCAGCGAGT 

GGACCAGATCCTGGAGACGCGGGATATGATTGGTCGCTGCTTTGTGCTCAGCCAGGACCTGGCCATCCGGGATGAGTT 

GGATGGTGGGGAATGGAAGTTCTGTGAGGGACGCCCCCT^GGCCATGAACAATTTGGGTTCTGCCAGCy^GC^ 

TGCCGCCTTCTCCCCTGATAGCCACTACCTCCTCTTTGGGGCCCCAGGAACCTATAATTGGAAGGGCACGGC^ 

GGAGCTCTGTGCACAGGGCTCAGCGGACCTGGCACACCTGGACGACGGTCCCTACGAGGC<k3GGGG^ 

GGACCCCCGCCTCATCCCGGTCCCTGCCAACAGCTACTTTGGCTTCTCTATTGACTCGGGGAAAGGTCTGGTGCGTGC 

AGAAGAGCTGAGCTTTGTGGCTGGAGCCCCCCGCGCCAACCACAAGGGTGCTGTGGTCATCCTGCGCAAGG^ 

CAGTCGCCTGGTGCCCGAGGTTATGCTGTCTGGGGAGCGCCTGACCTCCGGCTTTGGCTACTCACTGGCTGTGGCTGA 

CCTCAACAGTGATGGCTGGCCAGACCTGATAGTGGGTGCCCCCTACTTCTTTGAGCGCCT^GAAGAGCTQ 

TGTGTATGTGTACTTGT^CCAGGGGGGTCACTGGGCTGGGATCTCCCCTCTCCGGCTCTGCGGCTCCCCTGACTCCAT 

GTTCGGGATCAGCCTGGCTGTCCTGGGGGACCTCAACCAAGATGGCTTTCCAGATATTGCAGTGGGTGCCCCCTTTGA 

TGGTGATGGGAAAGTCTTCATCTACCATGGGAGCAGCCTGGGGGTTGTCGCCAAACCTTCACAGGTGCTGGAGGGCGA 

GGCTGTGGGCATCAAGAGCTTCGGCTACTCCCTGTCAGGCAGCTTGGATATGGATGGGAACCAATACCCTGACCTGCT 

GGTGGGCTCCCTGGCTGACACCGCAGTGCTCTTCAGGGCCAGACCCATCCTCCATGTCTCCCATGAGGTCTCTATTGC 

TCCACGAAGCATCGACCTGGAGCAGCCCAACTGTGCTGGCGGCCACTCGGTCTGTGTGGACCTAAGGGTCTGTTTCAG 

CTACATTGCAGTCCCCAGCAGCTATAGCCCTACTGTGGCCCTGGACTATGTGTTAGATGCGGACACAGACCGGAGGCT 

CCGGGGCCAGGTTCCCCGTGTGACGTTCCTGAGCCGTAACCTGGAAGAACCCAAGCACCAGGCCTCGGGCACCGTGTG 

GCTGAAGCACCAGCATGACCGAGTCTGTGGAGACGCCATGTTCCAGCTCCAGGAAAATGTCAAAGACAAGCTTCGGGC 

CATTGTAGTGACCTTGTCCTACAGTCTCCAGACCCCTCGGCTCCGGCGACAGGCTCCTGGCCAGGGGCTGCCTCCAGT 

GGCCCCCATCCTCAATGCCCACCAGCCCAGCACCCAGCGGGCAGAGATCCACTTCCTGAAGCAAGGCTGTGGTGAAGA 

CAAGATCTGCCAGAGCAATCTGCAGCTGGTCCACGCCCGCTTCTGTACCCGGGTCAGCGACACGGAATTCCAACCTCT 

GCCCATGGATGTGGATGGAACAACAGCCCTGTTTGCACTGAGTGGGCAGCCAGTCATTGGCCTGGAGCTGATGGTCAC 

CAACCTGCCATCGGACCCAGCCCAGCCCCAGGCTGATGGGGATGATGCCCATGAAGCCCAGCTCCTGGTCATGCTTCC 

TGACTCACTGCACTACTCAGGGGTCCGGGCCCTGGACCCTGCGGAGAAGCCACTCTGCCTGTCCAATGAGAATGCCTC 

CCATGTTGAGTGTGAGCTGGGGAACCCCATGAAGAGAGGTGCCCAGGTCACCTTCTACCTCATCCTTAGCACCTCCGG 

GATCAGCATTGAGACCACGGAACTGGAGGTAGAGCTGCTGTTGGCCACGATCAGTGAGCAGGAGCTGCATCCAGTCTC 

TGCACGAGCCCGTGTCTTCATTGAGCTGCCACTGTCCATTGCAGGAATGGCCATTCCCCAGCAACTCTTCTTCTCTGG 

TGTGGTGAGGGGCGAGAGAGCCATGCAGTCTGAGCGGGATGTGGGCAGCAAGGTCAAGTATGAGGTCACGG^ 

CCAAGGCCAGTCGCTCAGAACCCTGGGCTCTGCCTTCGTCT^CATCyVTGTGGCCTCATGAGA . 

GTTGCTGTACCCTUVTGCAGGTTGAGCTGGAGGGCGGGCAGGGGCCTGGGCAGAAAGGGCTT^ 

CATCCTCCACCTGGATGTGGACAGTAGGGATAGGAGGCGGCGGGAGCTGGAGCCACCTGAGCAGCAGGAGCCTGGTGA 
GCGGCAGGAGCCCAGCATGTCCTGGTGGCCAGTGTCCTCTGCTGAGAAGAAGAAAAACATCACCCTGGACTGCGCCCG 
GGGCACGGCCAACTGTGTGGTGTTCT^GCTGCCCACTCTACAGCTTTGACCGCGCGGCTGTGCTGCATGTCTGGGGCCG 
TCTCTGGAACAGCACCTTTCTGGAGGAGTACTCAGCTGTGAAGTCCCTGGAAGTGATTGTCCGGGCCAACATCACAGT 
GAAGTCCTCCATAAAGAACTTGATGCTCCGAGATGCCTCCT^CAGTGATCCCAGTGATGGTATACTTGGACCCCATGGC 
TGTGGTGGCAGAAGGAGTGCCCTGGTGGGTCATCCTCCTGGCTGTACTGGCTGGGCTGCTGGtGCTAGCACTGCTGGT 
GCTGCTCCTGTGGAAGATGGGATTCTTCAAACGGGCGAAGCACCCCGAGGCCACCGTGCCCCAGTACCATGCGGTGAA 
GATTCCTCGGGAAGACCGACAGCAGTTCAAGGAGGAGAAGACGGGCACCATCCTGAGGAACAACTGGGGCAGCCCCCG 
GCGGGAGGGCCCGGATGCACACCCCATCCTGGCTGCTGACGGGCATCCCGAGCTGGGCCCCGATGGGCATCCAGGGCC 
AGGCACCGCC TAG GTTCCCATGTCCCAGCCTGGCCTGTGGCTGCCCTCCATCCCTTCCCCAGAGATGGCTCCTTGGGA 
TGAAGAGGGTAGAGTGGGCTGCTGGTGTCGCATCAAGATTTGGCAGGATCGGCTTCCTCAGGGGCACAGACCTCTCCC 
ACCCACAAGAACTCCTCCCACCCAACTTCCCCTTAGAGTGCTGTGAGATGAGAGTGGGTAAATCAGGGACAGGGCCAT 
GGGGTAGQGTGAGAAGGGCAGGGGTGTCCTGATGCAAAGGTGGGGAGAAGGGATCCTAATCCCTTCCTCTCCCATTCA 
CCCTGTGTAACAGGACCCCAAGGACCTGCCTCCCCGGAAGTGCCTTAACCTAGAGGGTCGGGGAGGAGGTTGTGTCAC 
TGACTCAGGCTGCTCCTTCTCTAGTTTCCCCTCTCATCTGACCTTAGTTTGCTGCCATCAGTCTAGTGGTTTCGTGGT 
TTCGTCTATTTATT7U\AAAATATTTGAGAACAAAAAAAAAAAAA7^AAAAAA 



FIGURE 177 

></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA55737 
xsubunit 1 of 1, 1141 aa, 1 stop 
xMW: 124671, pi: 5.82, NX(S/T): 5 

MAGARSRDPWGASGICYLFGSLLVELLFSRAVAFNLDVMGALRKEGEPGSLFGFSVALHRQLQPR 

PQSWLLVGAPQALALPGQQANRTGGLFACPLSLEETDCYRVDIDQGADMQKESKENQWLGVSVRS 

QGPGGKIVTCAHRYEARQRVDQILETRDMIGRCFVLSQDLAIRDELDGGEWKFCEGRPQGHEQFG 

FCQQGTAAAFSPDSHYLLFGAPGTYNWKGTARVELCAQGSADLAHLDDGPYEAGGEKEQDPRLIP 

VPANSYFGFSIDSGKGLVRAEELSFVAGAPRANHKGAWILRKDSASRLVPEVMLSGERLTSGFG 

YSLAVADLNSDGWPDLIVGAPYFFERQEELGGAVYVYLNQGGHWAGISPLRLCGSPDSMFGISIiA 

VLGDLNQDGFPDIAVGAPFDGDGKVFIYHGSSLGWAKPSQVLEGEAVGIKSFGYSLSGSLDMDG 

NQYPDLLVGSLADTAVLFRARPILHVSHEVSIAPRSIDLEQPNCAGGHSVCVDLRVCFSYIAVPS 

SYSPWALDYVLDADTDRRLRGQVPRVTFLSRNLEEPKHQASGTVWLKHQHDRVCGDAMFQLQEN 

VKDKLRAIWTLSYSLQTPRLRRQAPGQGLPPVAPILNAHQPSTQRAEIHFLKQGCGEDKICQSN 

LQLVHARFCTRVSDTEFQPLPMDVDGTTALFALSGQPVIGLELMVTNLPSDPAQPQADGDDAHEA 

QLLVMLPDSLHYSGVRALDPAEKPLCLSNENASHVECELGNPMKRGAQVTFYLILSTSGISIETT 

ELEVELLLATISEQELHPVSARARVFIELPLSIAGMAIPQQLFFSGWRGERAMQSERDVGSKVK 

YEVTVSNQGQSLRTLGSAFLNIMWPHEIANGKWLLYPMQVELEGGQGPGQKGLCSPRPNILHLDV 

DSRDRRRRELEPPEQQEPGERQEPSMSWWPVSSAEKKKNITLDCARGTANCWFSCPLYSFDRAA 

VLHWGRLWNSTFLEE YSAVKSLE VI VRANITVKS S I KNLMLRDAS WI PVlVr\^ 

PWWILLAVLAGLLVLALLVLLLWKMGFFKRAKHPEATVPQYHAVKIPREDRQQFKEEK^ 

NNWGS PRREGPDAHP I liAADGHPELGPDGHPGPGTA 



Important features s 
Signal peptide: 

amino acids 1-33 



Transmembrane domain: 

amino acids 1040-1062 



N-glycosylation sites. 

amino acids 86-89, 746-749, 949-952, 985-988 and 1005-1008 



Integrins alpha chain proteins. 

amino acids 1064-1071, 384-408, 1041-1071, 317-346, 443-465, 385- 
407, 215-224, 634-647, 85-99, 322-346, 470-479, 442-466, 379-408 
and 1031-1047 



FIGURE 178 



CX3CGCCGGGCGCAGGGAGCTGAGTGGACGGCTCGAGACGGCGGCGCGTGCAGCAGCTCCAGAAAG 

CAGCGAGTTGGCAGAGCAGGGCTGCATTTCCAGCAGGAGCTGCGAGCACAGTGCTGGCTCACAAC 

AAGATCCTO^GGTGTCAGCCGTACTGTGTGTGTGTGCAGCCGCTTGGTGCAGTCA 

AGCTGCCGCGGCGGTGGCTGCAGCCGGGGGGCGGTCGGACGGCGGTAATTTTCTGGATGATAAAC 

AATGGCTGACCT^CAATCTCTCAGTATGACT^GGAAGTCGGAC^ 

GTAGAGGATGATTATTTCCGCTVCTTGGAGTCCAGGAAAACCCTTCGATCAGGCTTTAGATCCTVGC 

TAAGGATCCATGCTTAAAGATGAAATGTAGTCGCCATAAAGTATGCATTGCTCAAGATTCTCAG 

CTGCAGTCTGCATTAGTCACCGGAGGCTTACACACAGGATGAAAGAAGCAGGAGTA 

CAGTGGAGGGGTCCCATATTATCCACCTGCAAGCAGTGCCCAGTGGTCTATCCCAGCCCTGTTTG 

TGGTTCAGATGGTCATACCTACTCTTTTCAGTGCAAACTAGAATATCAGGCATGTGTCTTAGGAA 

AACAGATCTCAGTCAAATGTGAAGGACATTGCCCATGTCCTTCAGATAAGCCCACCAGTACAAGC 

AGAAATGTTAAGAGAGCATGCA.GTGACCTGGAGTTCAGGGAA.GTGGC?U^CAGATT 

GTTCAAGGCCCTTCATGAAAGTGGAAGTCAAAACAAGAAGACAAAAACATTGCTGAGGCCTGAGA 

GAAGCAGATTCGATACCAGCATCTTGCCAATTTGCAAGGACTCACTTGGCTGGATGTTTAACAGA 

CTTGATACAAACTATGACCTGCTATTGGACCAGTCAGAGCTCAGAAGCATTTACCTTGATAAGAA 

TGAACA.GTGTACCAAGGCATTCTTCAATtCTTGTGACACATACAAGGACA 

ATGAGTGGTGCTACTGCTTCCAGAGACAGCAAGACCCACCTTGCCAGACTGAGCTCAGCAATATT 

C?^GAAGCGGCAAGGGGTAAAGAAGCTCCTAGGACAGTATATCCCCCTGTGTGATGAAGATG^ 

CTACAAGCCAACACAATGTCATGGCAGTGTTGGACAGTGCTGGTGTGTTGACAGATATGGAAATG 

AAGTCATGGGATCCAGAATAAATGGTGTTGCAGATTGTGCTATAGATTTTGAGATCTCCGGAG^ 

TTTGCTAGTGGCGATTTTCATGAATGGACTGATGATGAGGATGATGAAGACGATATTATGAATGA 

TGAAGATGAAATTGAAGATGATGATGAAGATGAAGGGGATGATGATGATGGTGGTGATGACCATG 

ATGTATACAT TTGAT TGATGACAGTTGAAATCT^TAAATTCTACATTTCTAATATTTACAAAAAT 

GATAGCCTATTTAAAATTATCTTCTTCCCCAATAACAAAATGATTCTAAACCTCACATATATOT 

GTATAATTATTTGAAAAATTGCAGCTAAAGTTATAGAACTTTATGTTTAAATAAGAATCATTTGC 

TTTGAGTTTTTATATTCCTTACACAAAAAGAAAATACATATGaVGTCTAGTCAGAC^^ 

TTTTGAAGTGCTACTATAATAAATTTTTCTVCGAGAACAAACTTTGTAAATCTTCCATAAGCAA^ 

TGACAGCTAGtGCTTGGGATCGTACATGTTAATTTTTTGAAAGATAATTCTAAGTGAAATTTi^ 

ATAAATAAATTTTTAATGACCTGGGTCTTAAGGATTTAGGAAAAATATGCATGCTTTAATTGCT^ 

TTCCAAAGTAGCATCTTGCTAGACCTAGATGAGTCAGGATAACAGAGAGATACCACATGACTCCA 

AAAAAAAAAAAAAA 




FIGURE 17 9 

></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA49829 
xsubunit 1 of 1, 436 aa, 1 stop 
xMW: 49429, pi: 4,80, NX(S/T): 0 

MLKVSAVLCVCAAAWCSQSLAAAAAVAAAGGRSDGGNFLDDKQWLTTISQ^^ 

EDDYFRTWS PGKPFDQALDPAKDPCLKMKCSRHKVC I AQDSQTAVC I SHRRLTHRMKEAGVDHRQ 

WRGPILSTCKQCPVVYPSPVCGSDGHTYSFQCKLEYQACVLGKQISVKCEGHCPCPSDKPTSTSR 

NVKRACSDLEFREVANRLRDWFKALHESGSQNKKTKTLLRPERSRFDTS I LP ICKDSLGWMFNRL 

DTNYDLLLDQSELRSIYLDKNEQCTKAFFNSCDTYKDSLISNNEWCYCFQRQQDPPCQTELSNIQ 

KRQGVKKLLGQYIPLCDEDGYYKPTQCHGSVGQCWCVDRYGNEVMGSRINGVADCAIDFEISGDF 

ASGDFHEWTDDEDDEDDIMNDEDEIEDDDEDEGDDDDGGDDHDVYI 

Xmportant features : 
Signal peptide: 

amino acids 1-16 » 

Leucine zipper pattern* 

amino acids 246-267 

N-myristoylation sites . 

amino acids 357-362, 371-376 and 376-381 

Thyroglobulin type-1 repeat proteins 

amino acids .353-365 and 339-352 




FIGURE 180 

CAGACTCCAGATTTCCCTGTCAACCACGAGGAGTCCAGAGAGGAAACGCGGAGCGGAGACAACAGTACCTGACGCCTC 

TTTCAGCCCGGGATCGCCCCAGCAGG GATG GGCGACAAGATCTGGCTGCCCTTCCCCGTGCTCCTTCTGGCCGCTCTG 

CCTCCGGTGCTGCTGCCTGGGGCGGCCGGCTTCACACCTTCCCTCGATAGCGACTTCACCTTTACCCTTCCCGCCGGC 

CAGAAGGAGTGCTTCTACCAGCCCATGCCCCTGAAGGCCTCGCTGGAGATCGAGTACCAAGTTTTAGATGGAGCAGGA 

TTAGATATTGATTTCCATCTTGCCTCTCCAGAAGGCAAAACCTTAGTTTTTGAACAAAGAAAATCAGATGGAGTTCAC 

ACTGTAGAGACTGAAGTTGGTGATTACATGTTCTGCTTTGACAATACATTCAGCACCATTTCTGAGAAGGTGATTTTC 

TTTGAATTAATCCTGGATAATATGGGAGAACAGGCACAAGAACAAGAAGATTGGAAGAAATATATTACTGGCACAGAT 

ATATTGGATATGAAACTGG7VAGACATCCTGGAATCCATCAACAGCATCAAGTCCAGACTAAGCAAAAGTGOT 

CAAATTCTGCTTAGAGCATTTGAAGCTCGTGATCGAAACATACAAGAAAGCAACTTTGAT^ 

ATGGTTAATTTAGTGGTCATGGTGGTGGTGTCAGCCATTCy^GTTTATATGCTGAAGAGTCTGTTTGT^GATAAGAGG 

AAAAGTAGAAC TTAAA ACTCCAAACTAGAGTACGTj^CATTGAAAAATGAGGCATAAAAATGCAA 

TCAAGACCATTAATGGTCTTCTCCAAAATATTTTGAGATATAAAAGTAGGAAACAGGTATAATTTTAATGT 

AAGTCTTCACTTTCTGTGCAAGTAATCCTGCtGATCaVGTTGTACTTAAGTGTGTAACAGGAATATTTTGCA 

AGGTTTAACTGAATGAAGCCATATTAATAACTGCATTTTCCTAACTTTGAAAAATTTTGCAAATC 

AAATAAATGAGTATTGGGCCTAATTGCAACACCAGTCTGTTTTTAACAGGTTCTATTACCCAGAACTTTTTTG 

GCGGCAGTTACAAATTAACTGTGGAAGTTTTCAGTTTTAAGTTATAAATCACCTGAGAATTACCTAATGATGGATTGA 

ATAAATCTTTAGACTACAAAAGCCCAACTTTTCTCTATTTACATATGCATCTCTCCTATAATGTAAATAGAATAATAG 

CTTTGAAATACAATTAGGTTTTTGAGATTTTTATAACCAAATACATTTCAGTGTAACATATTAGCAGAAAGCATTAGT 

CTTTGTACTTTGCTTACATTCCCAAAAGCTGACATTTTCACGATTCTTAAAAACACAAAGTTACACTTACTAAAATTA 

GGACATGTTTTCTCTTTGAAATGAAGAATATAGTTTAAAAGCTTCCTCCTCCATAGGGACACATTTTCTCTAACCCTT 

AACTAAAGTGTAGGATTTTAAAATTAAATGTGAGGTAAAATAAGTTTATTTTTAATAGTATCTGTCAAGTTAATATCT 

GTCAACAGTTAATAATCATGTTATGTTAATTTTAACATGATTGCTGACTTGGATAATTCATTATTACCAGCAGTTATG 

AAGGAAATATTGCTAAAATGATCTGGGCCTACCATAAATAAATATCTCCTTTTCTGAGCTCTAAGAATTATCAGAAAA 

CAGGAAAGAATTTAGAAAAACTTGAGAAAACCTAATCCAAAATAAAATTCACTTAAGTAGAACTATAAATAAATATCT 

AGAATCTGACTGGCTCATCATGACATCCTACTCATAACATAAATCAAAGGAGATGATTAATTTCCAGTTAGCTGGAAG 

AAACTTTGGCTGTAGGTTTTTATTTTCTACAAGAATTCTGGTTTGAATTATTTTTGTAAGCAGGTACATTTTATAAAA 

TGTAAGCCCTACTGTAAGGTTTAGCACTGGGTGTACATATTTATTAAAAATTTTTATTATAACAACTTTTATTAAAAT 

GGCCTTTCTGAACACTTTATTTATTGATGTTGAAGTAAGGATTAGAAACATAGACTCCCAAGTTTTAAACACCTAAAT 

GTGAATAACCCATATATACAACAAAGTTTCTGCCATCTAGCTTTTTGAAGTCTATGGGGGTCTTACTCAAGTACTAGT 

AATTTAACTTCATCATGAATGAACTATAATTTTTAAGTTATGCCCATTTATAACGTTGTTTATGACTACATTGTGAGT 

TAGAAACAAACTTAAAATTTGGGGTATAGAACCCCTCAACAGGTTAGTAATGCTGGAATTCTTGATGAGCAATA^ 

TAACCAGAGAGTGATTTCATTTACACTCATAGTAGTATAAAAAGAGATACyVTTTCCCTCTTAGGCCCCTGGGAGAAGA 

GCAGCTTAGATTTCCCTACTGGCAAGGTTTTTAAAAATGAGGTAAATGCCGTATATGATCAATTACCTTAATTG^ 

AGAAAATGCTTCAGGTGTCTAGGGGTATCCTCTGCAACACTTGCAGAACAAAGGTCAATAAGATCCTTGCCTATGAAT 

ACCCCTCCCTTTTGCGCTGTTAAATTTGCAATGAGAAGCAAATTTACAGTACCATAACTAATAAAGCAGGGTA^ 

ATAAACTACTGCATCTTTTCTATAAAACTGTGATTAAGAATTCTACCTCTCCTGTATGGCTGTTACTGTACTGTACTC 

TCTGACTCCTTACCTAACAATGAATTTGTTACATAATCTTCTACATGTATGATTTGTGCCACTGATCTTAAACCTATG 

ATTCAGTAACTTCTTACCATATAAAAACGATAATTGCTTTATTTGGAAAAGAATTTAGGAATACTAAGGAC^ 

TTTATAGACAAAGTAAAAAGACAGATATTTAAGAGGCATAACCAAAAAAGCAAAACTTGTAAACAGAGTAAAAATCTT 

ATGCTAAATTGAGTAAGCTGTTTATCACTTAACAGCTCATTTTGTCTTTTTCAATATACAAATTTTAAAAATACTACA 
ATATTTAACTAAGGCCCAACCGATTTCCATAATGTAGCAGTTACCGTGTTCACCTCACACTAAGGCCTAGAGTTTGCT 
CTGATATGCATTTGGATGATTAATGTTATGCTGTTCTTTCATGTGAATGTCAAGACATGGAGGGTGTTTGTAATTTTA 
TGGTAAAATTAATCCTTCTTACACATAATGGTGTCTTAAAATTGACAAAAAATGAGCACTTACAATTGTATGTCTCCT 
CAAATGAAGATTCTTTATGTGAAATTTTAAAAGACATTGATTCCGCATGTAAGGATTTTTCATCTGAAGTACAATAAT 
GCACAATCAGTGTTGCTCAAACTGCTTTATACTTATAAACAGCCATGTTAAATAAGCAACGTATTGTGAGTACTGATA 
TGTATATAATAAAAATTATCAAAGGAAAA 



FIGURE 181 

></usr/seqdb2/sst/DNA/Dnaseqs -min/ss .DNA52196 
xsubunit 1 of 1, 229 aa, 1 stop 
xMW: 26017, pi: 4.73, NX(S/T): 0 

MGDKIWLPFPVLLLA/^PPVLLPGAAGFTPSLDSDFTFTLPAGQKECFYQPMPLKASLEIEYQVL 
DGAGLDIDFHLASPEGKTLVFEQRKSDGVHTVETEVGDYMFCFDNTFSTISEKVIFFELILDNMG 
EQAQEQEDWKKYITGTDILDMKLEDILESINSIKSRLSKSGHIQILLRAFEARDRNIQESNFDRV 
NFWSMVNLVVMVWSAIQVYMLKSLFEDKRKSRT 

Important features : 
Signal peptide: 

amino acids 1-23 

Transmembrane domain ; 
amino acids 195-217 

N-myristoylation site. 

amino acids 43-48 



Tyrosine kinase phosphorylation site. 

amino acids 55-62 




FIGURE 182 

CCATCCCTGAGATCTTTTTATAAAAAACCCTVGTCTTTGCTGACCAGACAAAGC^ 

ACCAGAGAGTCGCAGACAC TATG CTGCCTCCCATGGCCCTGCCCAGTGTGTCCTGGATGCTGCTT 

TCCTGCCTCATTCTCCTGTGTCAGGTTCAAGGTGAAGAAACCCT^GAAGGAACTGCCCTCT 

GATCAGCTGTCCCAAAGGCTCCAAGGCCTATGGCTCCCCCTGCTATGCCTTGTTTTTGTCACCA^ 

AATCCTGGATGGATGCAGATCTGGCTTGCCAGAAGCGGCCCTCTGGAT^AACTGGTGTCTGTGCTC 

AGTGGGGCTGAGGGATCCTTCGTGTCCTCCCTGGTGAGGAGCATTAGTAACAGCTACTCATACAT 

CTGGATTGGGCTCCTVTGACCCCACACAGGGCTCTGAGCCTGATGGAGATGGATGC^ 

GCACTGATGTGATGAATTACTTTGCATGGGAGAAAAATCCCTCCACCy^TCT^ 

TGTGGGAGCCTGTCAAGAAGCACAGGATTTCTGAAGTGGAAAGATTATAAC^ 

ACCCTATGTCTGCAAGTTCAAGGACTAGGGCAGGTGG^ 

GCTCATCATGGACATGAGACCAGTGTGAAGACTCACCCTGGAAGAGAATATTCTCCCCAAACTGC 
CCTACCTGACTACCTTGTCATGATCCTCCTTCTTTTTCCTTTTTCTTCACCTTCATTTCT^GGCTT 
TTCTCTGTCTTCCATGTCTTGAGATCTCAGAGAATAATAATAAAAATGTTACTTTATAAAAAAAA 
AAAAAAAAAAAAAA 



FIGURE 183 

</usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA56965 
<subunit 1 of 1, 175 aa, 1 stop 
<MW: 19330, pi: 7.25, NX(S/T): 1 

MLPPMALPSVSWMLLSCLILLCQVQGEETQKELPSPRISCPKGSKAYGSPCYALFLSPKSWMDAD 
LACQKRPSGKLVSVLSGAEGSFVSSLWSISNSYSYIWIGLHDPTQGSEPDGDGWEWSSTDVNn^ 
FAWEKNPSTILNPGHCGSLSRSTGFLKWKDYNCDAKLPYVCKFKD 

Important features: 
Signal peptide: 

amino acids 1-26 



C*type lectin domain signature. 

amino acids 146-171 




FIGURE 184 

CCAGTCTGTCGCCyVCCTCT^CTTGGTGTCTGCTGTCCCCGCaVGGCTUVGCCTGG^ 

GAGGAGTGGGCCGGGAC CATG CGGGGGACGCGGCTGGCGCTCCTGGCGCTGGTGCTGGCTGCCTG 

CGGAGAGCTGGCGCCGGCCCTGCGCTGCTACGTCTGTCCGGAGCCCACAGGAGTGTCGGACTGTG 

TCACCATCGCCACCTGCACCACCAACGAAACCATGTGCAAGACCACACTCTACTCCCG^ 

GTGTACCCCTTCCAGGGGGACTCCACGGTGACCAAGTCCTGTGCCy^GCAAGTGTAAGCCCTCGGA 

TGTGGATGGCATCGGCOVGACCCTGCCCGTGTCCTGCTGCAATACTGAGCTGTGC^ 

GGGCGCCCGCTCTGAACAGCCTCCACTGCGGGGCCCTCACGCTCCTCCCACTCTTGAGCCTCCGA 

CT GTAGA GTCCCCGCCCACCCCCT^TGGCCCTATGCGGCCCAGCCCCGAATGCCTTGAAGAAGTGC 

CCCCTGCACCAGGAAAAAAAAAAAAAAAAA 



FIGURE 185 

</usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA56405 
<subunit 1 of 1/ 125 aa, 1 stop 
<MW: 13115, pi: 5.90, NX(S/T): 1 
MRGTRLALIJU^VLAACGEIAPALRC WCPE PTGVSDC^ 

DSTVTKSCASKCKPSDVDGIGQTLPVSCCNTELCNVDGAPALNSLHCGALTLLPLLSLRL 

Important features s 
Signal peptide: 

amino acids 1-17 

N-glycosylation site. 

amino acids 46-4 9 




FIGURE 186 

CTGCAGTCAGGACTCTGGGACCGCAGGGGGCTCCCGGACCCTGACTCTGCAGCCGAACCGGCACG 

GTTTCGTGGGGACCCAGGCTTGCAAAGTGACGGTCATTTTCTCTTTCTTTCTCCCTCTTGAGTC 

TTCTGA GATGA TGGCTCTGGGCGCAGCGGGAGCTACCCGGGTCTTTGTCGCGATGGTAGCGGCGG 

CTCTCGGCGGCCACCCTCTGCTGGGAGTGAGCGCCACCTTGAACTCGGTTCTCAATTCCAACG 

ATCAAGAACCTGCCCCO^CCGCTGGGCGGCGCTGCGGGGCACCCA 

GCCGGGAATCCTGTACCCGGGCGGGAATAAGTACCAGACCATTGACAACTACCAGCCGT^ 

GCGCAGAGGACGAGGAGTGCGGCACTGATGAGTACTGCGCTAGTCCCACCCGCGGAGGGGACGCA 

GGCGTGCAAATCTGTCTCGCCTGCAGGAAGCGCCGAAAACGCTGCATGCGTCACGCTATGTGCTG 

CCCCGGGAATTACTGCAAAAATGGAATATGTGTGTCTTCTGATCAAAATCATTTCCGAGGAGAAA 

TTGAGGAAACCT^TCACTGAAAGCTTTGGTAATGATCATAGCACCTTGGATGGGTATTCCAGAA 

ACCACCTTGTCTTCATyyUVTGTATCACACCAAAGGACAAGAAGGTTCTGTTTGTCTCCGGTC^ 

AGACTGTGCCTCAGGATTGTGTTGTGCTAGACACTTCTGGTCCAAGATCTGTAAACCTGTCCTGA 

AAGAAGGTCAAGTGTGTACCAAGCATAGGAGAAAAGGCTCTCATGGACTAGAAATATTCCAGCGT 

TGTTACTGTGGAGAAGGTCTGTCTTGCCGGATACAGAAAGATCACCATCAAGCCAGTAATTCTTC 

TAGGCTTCACACTTGTCAGAGACAC TAA ACCT^GCTATCCAAATGCAGTGAACTCCTTTTATATA^ 

TAGATGCTATGAAAACCTTTTATGACCTTCATCAACTCAATCCTAAGGATATACAAGTTCTGTGG 

TTTCAGTTAAGCATTCCAATAACACCTTCCAAAAACCTGGAGTGTAAGAGCTTTGTTTCTTTATG 

GAACTCCCCTGTGATTGCAGTAAATTACTGTATTGTAAATTCTCAGTGTGGCACTTACCTGTAAA 

TGCAATGAAACTTTTAATTATTTTTCTAAAGGTGCTGCACTGCCTATTTTTCCTCTTGTTATGTA 

AATTTTTGTACACATTGATTGTTATCTTGACTGACAAATATTCTATATTGAACTGAAGTAAATCA 

TTTCAGCTTATAGTTCTTAAAAGCATAACCCTTTACCCCATTTAATTCTAGAGTCTAGAACGCAA 

GGATCTCTTGGAATGACAAATGATAGGTACCTAAAATGTAACATGAAAATACTAGCTTATTTTCT 

GAAATGTACTATCTTAATGCTTAAATTATATTTCCCTTTAGGCTGTGATAGTTTTTGAAATAAAA 

TTTAACATTTAAAAAAAAAAAAA 




FIGURE 187 

</usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA57530 
<s\abunit 1 of 1, 266 aa, 1 stop 
<MW: 28672, pi: 8.85, NX(S/T): 1 
MMALGAAGATRVFVA^WAAALGGHPLLGVSATLNSVIJ!^SNA 

I L YPGGNKYQTI DNYQP YPCAEDEECGTDE YCAS PTRGGDAGVQ I CLACRKRRKRCMRHAMCCPG 
NYCKNGICVSSDQNHFRGEIEETITESFGNDHSTLDGYSRRTTLSSKMYHTKGQEGSVCLRSSDC 
ASGLCCARHFWSKICKPVLKEGQVCTKHRRKGSHGLEIFQRCYCGEGLSCRIQKDHHQASNSSRL 
HTCQRH 

Important features : 
Signal peptide: 

amino acids 1-23 

N-glycosylation site . 

amino acids 256-259 



Fungal Zn(2)-Cys(6) binuclear cluster domain 

amino acids 110-126 




FIGURE 188 

TGTGTTTCCCTGCAGTCAGAATTTGGGACNGCAGGGGTTCCCGGACCTGATTTTGCAGCGGAACG 

GGAAGGTTTTGTGGGACCCT^GGTTGAAATGACGGTC^TTTTTTTTTCTTTCTCCTTO^ 

TTNTGAGANGATGGTTTTGGGCGCAGCGGGAGCTAACCCGGTTTTTTGTNGCGATG 

GTTTTCGGCGGCa^CCTTNTGCTGGGAGTGAGCGCO^CCTTGAATCGGTTTTC^ 

ATCAAGAACCTGCCCCCACCGNTGGGCGGCGCTGCGGGGCACCCAGGNTTTGCAGTCAGC^^ 

GCCGGGAATCCTGTACCCGGGCGGGAATAAGTAGCAGACCATTGACAATTACCAGCCGTACCCGT 

GCGCAGAGGACGAGGAGTGCGGCACTGATGAGTACTGCGCTAGTCCCACCCGCGGAGGGGANGCG 

GGCGTGCAAATNTGTNTNGCCTGCAGGAAGCGCCGAAAACGCTGCATGeGTCANGCTATGTGCTG 

CCCCGGGAATTACTGCAAAAATGGAATATGTGTGTNTTCTGATCAAAATCATTTCCGAGGAGAAA 

TTGAGGAAACCATCACTGAAAGCTTTGGTAATGATCATAGCACCTTGGATGGG 




FIGURE 189 



GAGGAACCtACCGGTACCGGCCGCGCGCTGGTAGTCGCCGGTGTGGCTGCACCTCACCAATCCCGTGCGCCGCGGCTG 

GGCCGTCGGAGAGTGCGTGTGCTTCTCTCCTGCACGCGGTGCTTGGGCTCGGCCAGGCGGGGTCCGCCGCCAGGGTTT 

GAGGATGGGGGAGTAGCTACAGGAAGCGACCCCGCGATGGCAAGGTATATTTTTGTGGAATGAAAAGGAAGTATTAGA 

AATGAGCTGAAGACCATTCACAGATTAATATTTTTGGGGACAGATTTGTGATGCTTGATTCACCCTTGAAGTAATGTA 

GACAGAAGTTCTCAAATTTGCATATTACATCAACTGGAACCAGCAGTGAATCTTAATGTTCACTTAAATCAGAACTTG 

CATAAGAAAGAGAATGGGAGTCTGGTTAAATAAAGATGACTATATCAGAGACTTGAAAAGGATCATTCTCTGTTTTCT 

GATAGTGTATATGGCCATTTTAGTGGGCACAGATCAGGATTTTTACAGTTTACTTGGAGTGTCCAAAACTGCAAGCAG 

TAGAGAAATAAGACAAGCTTTCAAGAAATTGGCATTGAAGTTACATCCTGATAAAAACCCGAATAACCCA^ 

TGGCGATTTTTTAAAAATAAATAGAGCATATGAAGTACTCAAAGATGAAGATCTACGGAAAA^ 

AGAAAAGGGACTTGAGGATAATCAAGGTGGCCAGTATGAAAGCTGGAACTATTATCGTTATGATTTTGG 

TGATGATCCTGAAATCATAACT^TTGGAAAGAAGAGAATTTGATGCTGCTGTTAATTCTGGAGAAOT 

TTTTTACTCCCCAGGCTGTTCACACTGCCATGATTTAGCTCCCACATGGAGAGACTTTC 

ACTTCGAATTGGAGCTGTTAACTGTGGTGATGATAGAATGCTTTGCCGAATGAAAGGAGTCAACAGCTATCC 

CTTCATTTTTCGGTCTGGAATGGCCCCAGTGAAATATCATGGAGACAGATCAAAGGAGAGTTTAGTGAG 

GCAGCATGTTAGAAGTACAGTGACAGAACTTTGGACAGGAAATTTTGTCAACTCCATACAAACTGCTTTTGCTGCTGG 

TATTGGCTGGCTGATCACTTTTTGTTCAAAAGGAGGAGATTGTTTGACTTCACAGACACGACTCAGGCTTAGTGGCAT 

GTTGTTTCTCAACTCATTGGATGCTAAAGAAATATATTTGGAAGTAATACATAATCTTCCAGATTTTGAACTACTTTC 

GGCAAACACACTAGAGGATCGTTTGGCTCATCATCGGTGGCTGTTATTTTTTCATTTTGGAAAAAATGAAAATTCAAA 

TGATCCTGAGCTGAAAAAACTA/IAAACTCTACTTAAAAATGATCATATTCAAGTTGGCAGGTTTGACTGTTCCTCTGC 

ACCAGACATCTGTAGTAATCTGTATGTTTTTCAGCCGTCTCTAGCAGTATTTAAAGGACAAGGAACCAAAGAATATGA 

AATTCATCATGGAAAGAAGATTCTATATGATATACTTGCCTTTGCCAAAGAAAGTGTGAATTCTCATGTTACCACGCT 

TGGACCTCAAAATTTTCCTGeCAATGACAAAGAACCATGGCTTGTTGATTTCTTTGCCCCCTGGTGTCCACCATGTCG 

AGCTTTACTACCAGAGTTACGAAGAGCATCAAATCTTCTTTATGGTCAGCTTAAGTTTGGTACACTAGATTGTACAGT 

TCATGAGGGACTCTGTAACATGTATAACATTCAGGCTTATCCAACAACAGTGGTATTCTVACCAGTCCAACATTCATGA 

GTATGAAGGACATCACTCTGCTGAACAAATCTTGGAGTTCATAGAGGATCTTATGAATCCTTCAGTGGTCTCCCTTAC 

ACCCACCACCTTCAACGAACTAGTTACACAAAGAAAACACAACGAAGTCTGGATGGTTGATTTCTATTCTCCGTGGTG 

TCATCCTTGCCAAGTCTTAATGCCAGAATGGAAAAGAATGGCCCGGACATTAACTGGACTGATCAACGTGGGCAGTAT 

AGATTGCCAACAGTATCATTCTTTTTGTGCCCAGGTUUUICGTTCAAAGATACCCTGAGATAAGATTTTTTCCCCCAAA 

ATCAAATAAAGCTTATCAGTATCACAGTTACAATGGTTGGAATAGGGATGCTTATTCCCTGAGAATCTGGGGTCTAGG 

ATTTTTACCTCAAGTATCCACAGATCTAACACCTCAGACTTTCAGTGAAAAAGTTCTACAAGGGAAAAATCATTGGGT 

GATTGATTTCTATGCTCCTTGGTGTGGACCTTGCCAGAATTTTGCTCCAGAATTTGAGCTCTTGGCTAGGATGATTAA 

AGGAAAAGTGAAAGCTGGAAAAGTAGACTGTCAGGCTTATGCTCAGACATGCCAGAAAGCTGGGATCAGGGCCTATCC 

AACTGTTAAGTTTTATTTCTACGAAAGAGCAAAGAGAAATTTTCAAGAAGAGCAGATAAATACCAGAC^ 

AATCGCTGCCTTAATAAGTGAAAAATTGGAAACTCTCCGAAATCAAGGCAAGAGGAATAAGGATGAACTT^ 

TTGAAGATGAAGAAAAAGTTTAAAAGAAATTCTGACAGATGACATCAGAAGACTVCCTATTTAC^^ 

ATGGGAATGAATGAACATTATCTTAGACTTGCAGTTGTACTGCCAGJ\ATTATCTACAGCACTGGTGTAAAAGAAGGGT 

CTGCAAACTTTTTCTGTAAAGGGCCGGTTTATAAATATTTTAGACTTTGCAGGCTATAATATATGGTTCT^CACATC 

AACAAGTW^TAGAGTCATCATGTATTCTTTGTTATTTGCTTTTAAC^CCTTTAAAAAATATTAAAACGATTCTTAGCT 

CAGAGCCATACAAAAGTAGGCTGGATTCAGTCCATGGACCATAGATTGCTGTCCCCCTCGACGGACTTATAATGTTTC 

AGGTGGCTGGCTTGAACATGAGTCTGCTGTGCTATCTACATAAATGTCTAAGTTGTATAAAGTCCACTTTCCCTTCAC 

GTTTTTTGGCTGACCTGAAAAGAGGTAACTTAGTTTTTGGTCACTTGTTCTCCTAAAAATGCTATCCCTAACCATATA 

TTTATATTTCGTTTTAAAAACACCCATGATGTGGCACAGTAAACAAACCCTGTTATGCTGTATTATTATGAGGAGATT 

CTTCATTGTTTTCTTTCCTTCTCA 

AAGGTTGAAAAAATGCTTTTAATTTTTCACAGCCGAGAAACAGTGCAGCAGTATATGTGCACACAGTAAGTACAC 

AAATTTGAGCAACAGTAAGTGCACAAATTCTGTAGTTTGCTGTATCATCCAGGAAAACCTGAGGGAAAAAAATTA 

TAGCAATTAACTGGGCATTGTAGAGTATCCTAAATATGTTATCAAGTATTTAGAGTTCTATATTTTAAAGATATATGT 

GTTCATGTATTTTCTGAAATTGCTTTCATAGAAATTTTCCCACTGATAGTTGATTTTTGAGGCATCTAATATTTACAT 

ATTTGCCTTCTGAACTTTGTTTTGACCTGTATCCTTTATTTACATTGGGTTTTTCTTTCATAGTTTTGGTTTTTCACT 

CCTGTCCAGTCTATTTATTATTCAAATAGGAAAAATTACTTTACAGGTTGTTTTACTGTAGCTTATAATGATACTGTA 

GTTATTCCAGTTACTAGTTTACTGTCAGAGGGCTGCCTTTTTCAGATAAATATTGACATAATAACTGAAGTTATTTTT 

ATAAGAAAATCAAGTATATAAATCTAGGAAAGGGATCTTCTAGTTTCTGTGTTGTTTAGACTCAAAGAATCACAAATT 

TGTCAGTAACATGTAGTTGTTTAGTTATAATTCAGAGTGTACAGAATGGTAAAAATTCCAATCAGTCAAAAGAGGTCA 

ATGAATTAAAAGGCTTGCAACTTTTTCAAAAAAAAAAAAAAAAA 



FIGURE 190 



</usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA56439 
.<subunit 1 of 1, 747 aa, 1 stop 
<MW: 86127, pi: 7.46, NX(S/T): 2 

MGWLNKDDYIRDLKRIILCFLIVYMAILVGTDQDFYSLLGVSKTASSREIRQAFKKLALKLHPD 
KNPNNPNAHGDFLKINRAYEVLKDEDLRKKYDKYGEKGLEDNQGGQYESWNYYRYDFGIYDDDPE 
I ITLERREFDAAVNSGELWFVNFYS PGCSHCHDLAPTWRDFAKEVDGLLRIGAVNCGDDRMLCRM 
KGVNSYPSLFIFRSGMAPVKYHGDRSKESLVSFAMQHVRSTVTELWTGNFVNSIQTAFAAGIGWL 
ITFCSKGGDCLTSQTRLRIiSGMLFLNSLDAKEIYLEVIHNLPDFELLSANTLEDRLAHHRWLLFF. 
HFGKNENSNDPELKKLKTLLKNDHIQVGRFDCSSAPDICSNLYVFQPSLAVFKGQGTKEYEIHHG 
KKILYDIIJ^FAKESWSHVTTLGPQNFPANDKEPWLVDFFAPWCPPCRALLPELRRASNLLYGQL 
KFGTLDCTVHEGLCNMYNI QAYPTTWFNQSNIHE YEGHHSAEQ I LEF I EDLMNPS WSLTPTTF 
NELVTQRKIINEVWMVDF YS PWCHPCQVLMPEWKRMARTLTGL I W 

PEIRFFPPKSNKAYQYHSYNGWNRDAYSLRIWGLGFLPQVSTDLTPQTFSEKVLQGKNHWVIDFY 
APWCGPCQNFAPEFELLARMIKGKVKAGKVDCQAYAQTCQKAGIRAYPTVKFYFYERAKRNFQEE 
QI NTRDAKAI AAL I SEKLETLRNQGKRNKDEL 

Important features : 

Endoplasmic reticulum targeting sequence. 

amino acids 744-747 

Cytochrome c family heme-binding site signature. 

amino acids 158-163 

Nt-dnaJ domain signature. 

amino acids 77-96 

N-glycosylation site. 

amino acids 484-487 



FIGURE 191 

AGACAGTACCTCCTCCCTAGGACTACACAAGGACTGiUVCCAGAAGGAAGAGGACAGAGC^^ 

ATGA ACATCT^TCCTAGAAATCCTTCTGCTTCTGATCACCATCATCTACTCCTACTTGGAGTCGTT 

GGTGAAGTTTTTCATTCCTCAGAGGAGAAAATCTGTGGCTGGGGAGATTGTTCTOSlTT^^ 

ctgggcatggaataggcaggcagactacttatgaatttgcaaaacgacagagca 

tgggatattaataagcgcggtgtggaggaaactgcagctgagtgccgaaaactaggcgtcactc 

gcatgcgtatgtggtagactgct^gcaacagagaaqagatctatcgctctc^ 

aagaagtgggtgatgtaacaatcgtggtgaataatgctgggacagtatatcc^ 

agcaccaaggatgaagagattaccaagacatttgaggtcaacatcctaggacatttttggatcac 

aaaagcacttcttccatcgatgatggagagaaatcatggccacatcgtcacagtggcttcagtgt 

gcggccacgaagggattccttacctcatcccatattgttccagcaaatttgccgctgttggcttt 

cacagaggtctgacatcagaacttcaggccttgggaaaaactggtatcaaaacctcatgtctctg 

cccagtttttgtgaatactgggttcaccaaaaatccaagcacaagattatggcctgtattggaga 

cagatgaagtcgtaagaagtctgatagatggaatacttaccaataagaaaatgatttttgttcca 

tcgtatatcaatatctttctgagactacagaagtttcttcctgaacgcgcctcagcgattttaaa 

tcgtatgcagaatattcaatttgaagcagtggttggccacaaaatcaaaatgaaatgaataaata 

agctccagccagagatgtatgcatgataatgatatgaatagtttcgaatcaatgctgcaaagctt 

GAACAAGATTAATTACCTGTCTTCCTGTTTCTCAAGAATATTTACGTAGTTTTTCATAGGTCTGT 

TTTTCCTTTCATGCCTCTTAAAAACTTCTGTGCTTACATAAACATACTTAAAAGGTfTTCTTTAA 

GATATTTTATTTTTCCATTTAAAGGTGGACAAAAGCTACCTCCCTAAAAGTAAATACAAAGAGAA' 

CTTATTTACACAGGGAAGGTTTAAGACTGTTCAAGTAGCATTCCAATCTGTAGCCATGCCACA^ 

ATATCAACAAGAACACAGAATGAGTGCACAGCTAAGAGATCAAGTTTCAGCAGGCAGCTTTAT^ 

CAACCTGGACATATTTTAAGATTCAGCATTTGAAAGATTTCCCTAGCCTCTTCCTTTTTCATTAG 

CCCAAAACGGTGCAACTCTATTCTGGACTTTATTACTTGATTCTGTCTTCTGTATAACTCTGAAG 

TCCACCAAAAGTGGACCCTCTATATTTCCTCCCTTTTTATAGTCTTATAAGATACATTATGAAAG 

GTC^CCGACTCTATTTTAAATCTCAGAATTTTAAGTTCTAGCCCCATGATAACCTTTTTCTTTGT 

AATTTATGCTTTCATATATCCTTGGTCCCAGAGATGTTTAGACAATTTTAGGCTCAAAA^ 

GCTAACACAGGAAAAGGAACTGTACTGGCTATTACATAAGAAACAATGG^^^ 




FIGURE 192 

</usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA56409 
<siibunit 1 of 1, 300 aa, 1 stop 
<MW: 33655, pi: 9.31, NX(S/T): 1 

MNI ILEILLLLITI lYSYLESLVKFFIPQRRKSVAGEIVLITGAGHGIGRQTTYEFAKRQSILVL 
WDINKRGVEETAAECRKLGVTAHAYVVDCSNREEIYRSLNQVKKEVGDVTIV^^ 
STKDEEITKTFEVNILGHFWITKALLPSMMERNHGHIVTVASVCGHEGIPYLIPYCSSKFAAVGF 
HRGLTSELQALGKTGIKTSCLCPVFVOTGFTKNPSTRLWPVLETDEVVRSLIDGILTNKKM 
S Y INI FLRLQKFLPERAS AI LNRMQN I QFEAWGHKI KMK 

Important features : 
Signal peptide: 

amino acids 1-19 

cAMP- and cGMP- dependent protein kinase phosphorylation site. 

amino acids 30-33 and 58-61 

Short- chain alcohol dehydrogenase family protein 

amino acids 165-202, 37-49, 112-122 and 210-219 . 




FIGURE 193 

CGGCGGCGGCTGCGGGCGCGAGGTGAGGGGCGCGAGGTGAGGGGCGCGAGGTTCCCAGCAGGATG 

CCCCGGCTCTGCAGGAAGCTGAAGTGAGAGGCCCGGAGAGGGCCCAGCCCGCCCGGGGCAG G 

ACCAAGGCCCGGCTGTTCCGGCTGTGGCTGGTGCTGGGGTCGGTGTTCATGATCCTGCTGATC^ 

CGTGTACTGGGACAGCGC?VGGCGCCGCGCACTTCTACTTGCACACGTCCTTCTCTAG^ 

CGGGGCCGCCGCTGCCCACGCCCGGGCCGGACAGGGACAGGGAGCTCACGGCCGACTCCGATGTC 

GACGAGTTTCTGGAOU^GTTTCTCAGTGCTGGCGTGAAGCAGAGCGACCTTCCCAGAAAGGAGA 

GGAGCAGCCGCCTGCGCCGGGGAGCATGGAGGAGAGCGTGAGAGGCTACGACTGGTCCCCGCGCG 

ACGCCCGGCGCAGCCCAGACCAGGGCCGGCAGCAGGCGGAGCGGAGGAGCGTGCTGCGGGGCTTC 

TGCGCCAACTCCAGCCTGGCCTTCCCCACCAAGGAGCGCGCATTCGACGACATCCCCAACTCGGA 

GCTGAGCCACCTGATCGTGGACGACCGGCACGGGGCCATCTACTGCTACGTGCCCAAGGTGGCCT 

GCACCAACTGGAAGCGCGTGATGATCGTGCTGAGCGGAAGCCTGCTGCACCGCGGTGCGCCCTAC 

CGCGACCCGCTGCGCATGCCGCGCGAGCACGTGCACAACGCCAGCGCGCACCTGACCTTCAACAA 

GTTCTGGCGCCGCTACGGGAAGCTCTCCCGCCACCTCATGAAGGTCAAGCTCAAGAAGTACACCA 

AGTTCCTCTTCGTGCGCGACCCCTTCGTGCGCCTGATCTCCGCCTTCCGCAGCAAGTTCGAGCTG 

GAGAACGAGGAGTTCTACCGCAAGTTCGCCGTGCCCATGCTGCGGCTGTACGCCAACCACACCAG 

CCTGCCCGCCTCGGCGCGCGAGGCCTTCCGCGCTGGCCTCAAGGTGTCCTTCGCCAACTTCATCC 

AGTACCTGCTGGACCCGCACACGGAGAAGCTGGCGCCCTTCAACGAGCACTGGCGGCAGGTGTAC 

CGCCTCTGCCACCCGTGCCAGATCGACTACGACTTCGTGGGGAAGCTGGAGACTCTGGACGAGGA 

CGCCGCGCAGCTGCTGCAGCTACTCCAGGTGGACCGGCAGCTCCGCTTCCCCCCGAGCTACCGGA 

ACAGGACCGCCAGCAGCTGGGAGGAGGACTGGTTCGCCAAGATCCCCCTGGCCTGGAGGCAGCAG 

CTGTATAAACTCTACGAGGCCGACTTTGTTCTCTTCGGCTACCCCAAGCCCGAAAACCTCCTCCG 

agactgaaagctttcgcgttgctttttctcgcgtgcctggaacctgacgcacgcgcactccagtt' 

TTTTTATGACCTACGATTTTGCAATCTGGGCTTCTTGTTCACTCCACTGCCTCTATCCATTGAGT 
ACTGTATCGATATTGTTTTTTAAGATTAATATATTTCAGGTATTTAATACGA 




FIGURE 194 

</usr/seqdb2/sst/DNA/Dnaseqs .min/ss . DNA56112 
csubunit 1 of 1, 414 aa, 1 stop 
<MW: 48414, pi: 9.54, NX(S/T): 4 

MTKARLFRLWLVLGSVFMILLIIVYWDSAGAAHFYLHTSFSRPHTGPPLPTPGPDRDRELTADSD 

VDEFLDKFLSAGVKQSDLPRKETEQPPAPGSMEESVRGYDWSPRDARRSPDQGRQQAERRSVLRG 

FCANSSIAFPTKERAFDDIPNSELSHLIVDDRHGAIYCYVPKVACTNWKRVMIV^ 

YRDPLRIPREHVHNASAHLTFNKFWRRYGKLSRHLMKVKLKKYTKFLFVRDPFVRLISAFRSKFE 

LENEEFYRKFAVPMLRLYANHTSLPASAREAFRAGLKVSFANFIQYLLDPHTEKLAPFNEHWRQV 

YRLCHPCQIDYDFVGKLETLDEDAAQLLQLLQVDRQLRFPPSYRNRTASSWEEDWFAKIPLAWRQ 

QLYKLYEADFVLFGYPKPENLLRD 

Important features : 
Signal peptide: 

amino acids 1-31 

N-glycosylation sites. 

amino acids 134-137, 209-212, 280-283 and 370-373 

TNFR/NGFR family cys teine-rich region protein 

amino acids 32 9-332 




FIGURE 195 

TCGGGCCAGAATTCGGCACGAGGCGGCACGAGGGCGACGGCCTCACGGGGCTTTGGAOT 

AGGCCCAGAGTAGAGAGAGAGAGAGACCGACGTACACGG GATGG CTACGGGAACGCGCTATGCCG 

GGAAGGTGGTGGTCGTGACCGGGGGCGGGCGCGGCATCGGAGCTGGGATCGTGCGCGCCTTCGTC 

AACAGCGGGGCCCGAGTGGTTATCTGCGACAAGGATGAGTCTGGGGGCCGGGCCCTGGAGCAGGA 

GCTCCCTGGAGCTGTCTTTATCCTCTGTGATGTGACTCAGGAAGATGATGTGAAGACCCTGGTTT 

CTGAGACCATCCGCCGATITGGCCGCCTGGATTGTGTTGTCAACAACGCTGGCCTICCACCC^ 

CCACAGAGGCCTGAGGAGACCTCTGCCCAGGGATTCCGCCAGCTGCTGGAGCTGAACCTACTGGG 

GACGTACACCTTGACO^^GCTCGCCCTCCCCTACCTGCGGAAGAGTCAAGGGAATGTCATCAACA 

TCTCCAGCCTGGTGGGGGCAATCGGC(:A.GGCCCAGGCAGTTCCCTATGTGGCCACCi^ 

GTAACAGCCATGACCAAAGCTTTGGCCCTGGATGAAAGTCCATATGGTGTCCGAGTCAACTGTAT 

CTCCCCAGGAAACATCTGGACCCCGCTGTGGGAGGAGCTGGCAGCCTTAATGCCAGACCCTAGGG 

CCACAATCCGAGAGGGCATGCTGGCCCAGCCACTGGGCCGCATGGGCCAGCCCGCTGAGGTCGGG 

GCTGCGGCAGTGTTCCTGGCCTCCGAAGCCAACTTCTGCACGGGCATTGAACTGCTCGTGACGGG 

GGGTGCAGAGCTGGGGTACGGGTGCAAGGCCAGTCGGAGCACCCCCGTGGACGCCCCCGATATCC 

CTTCC TGA TTTCTCTCATTTCTACTTGGGGCCCCCTTCCTAGGACTCTCCCACCCCAAACTCCAA 

CCTGTATCAGATGCAGCCCCCAAGCCCTTAGACTCTAAGCCCAGTTAGCAAGGTGCCGGGTCACC 

CTGCAGGTTCCCATAAA?^CGATTTGCAGCC 




FIGURE 196 

</usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA56045 
<siabunit 1 of 1, 270 aa, 1 stop 
<MW: 28317, pi: 6.00, NX(S/T): 1* 

MATGTRYAGKVVVVTGGGRGIGAGI VRAFVNSGARWI CDKDESGGRALEQELPGAVF I LCDVTQ 

EDDVKTLVSETIRRFGRLDC^A^NNAGHHPPPQRPEETSAQGFRQLLELNLLGTYTLTK^^ 

KSQGl^INISSLVGAIGQAQAVPYVATKGAVTAMTKALALDESPYGWVNCISPGNIWT 

AALMPDPRATIREGMI^QPLGRMGQPAEVGAAAVFLASEANFCTGIELLVTGGAELGYGCKASRS 

TPVDAPDIPS 

Important features : 
N-glycosylation site. 

amino acids 138-141 

Short- chain alcohol dehydrogenase family protein 
amino acids 10-22, 81-91, 134-171 and 176-185 




FIGURE 197 

AGGCGGGCAGCAGCTGCAGGCTGACCTTGCAGCTTGGCGGAATGGACTG^ 

TTTCTTCTTACCATTTCCATCTTCCTGGGGCTGGGCCAGCCCAGGAGCCCCAAAAG 

GGGGCAAGGGCGGCCTGGGCCCCTGGCCCCTGGCCCTCACCAGGTGCCACTGGACCTGGTGTCA^ 

GGATGAAACCGTATGCCCGGATGGAGGAGTATGAGAGGAACT^TCGAGGAGATGGTGGCCCAGCT 

AGGAACAGCTCAGAGCTGGCCCAGAGAAAGTGTGAGGTCAACTTGCAGCTGTGGATGTCCAACAA 

GAGGAGCCTGTCTCCCTGGGGCTACAGCATCAACCACGACCCCAGCCGTATCCCCGTGGACCTGC 

CGGAGGCACGGTGCCTGTGTCTGGGCTGTGTGAACCCCTTCACCATGCI?^GGAGGACCG^ 

GTGAGCGTGCCGGTGTTCAGCCAGGTTCCTGTGCGCCGCCGCCTCTGCCCGCCACCGCCCCGCAC 

AGGGCCTTGCCGCCAGCGCGC7VGTCATGGAGACCATCGCTGTGGGCTGCACCTGCATCTT CTG 

TCACCTGGCCCAGAAGCCAGGCCAGCAGCCCGAGACCATCCTCCTTGCACCTTTGTGCCAAGAAA 

GGCCTATGAAAAGTAAACACTGACTTTTGAAAGCAAG 



FIGURE 198 

</usr/seqdb2/sst:/DNA/Dnaseqs .min/ss .DNA59294 
<subunit 1 of 1, 180 aa, 1 stop 
<MW: 20437, pi: 9.58> NX(S/T): 1 

MDWPHNLLFLLTISIPLGLGQPRSPKSKRKGQGRPGPLAPGPHQVPLDLVSRMKPYARMEEYERN 
I EEMVAQLRNSSELAQRKCEVNLQLWMSNKRSLS PWGYS INHDPSRI PVDLPEARCLCLGCVNPF 
TMQEDRSMVSVPVFSQVPVRRRLCPPPPRTGPCRQRAVMETIAVGCTCiF 

Important features: 
Signal peptide: 

amino acids 1-20 

N-glycosylation site. 

amino acids 75-78 



Homologous region to XL- 17 

amino acids 96-180. 



FIGURE 199 



GCGCCGCCAGGCGTAGGCGGGGTGGCCCTTGCGTCTCCCGCTTCCTTGAAAAACCCGGCGGGCGA 

GCGAGGCTGCGGGCCGGCCGCTGCCCTTCCCCACACTCCCCGCCGAGAAGCCTCGCTCGGCGCCC 

AACATGGCGGGTGGGCGCTGCGGCCCGCAGCTAACGGCGCTCCTGGCCGCCTGGATCGCGGCTGT 

GGCGGCGACGGCAGGCCCCGAGGAGGCCGCGCTGCCGCCGGAGCAGAGCCGGGTCCAGCCCATGA 

CCGCCTCCAACTGGACGeTGGTGATGGAGGGCGAGTGGATGCTGAAATTTTACGCCCCATGGTGT 

CCATCCTGCCAGCTVGACTGATTCAGAATGGGAGGCtTTTGCAAAGAATGGTGA^ 

CAGTGTGGGGAAGGTAGATGTCATTCAAGAACCAGGTTTGAGTGGCCGCTTCTTTGTCACCACTC 

GACCTGCAGAATTATATCTTAGAGAAGAAATGGCAATCAGTCGAGCCTCTGACTGGCTGGAAATC 

CCCAGCTTCTCTAACGATGTCTGGAATGGCTGGTCTTTTTAGCATCTCTGGCAAGATATGGCATC 

TTCACAACTATTTCACAGTGACTCTTGGAATTCCTGCTTGGTGTTCTTATGTGTTTTTCGTCATA 

GCCACCTTGGTTTTTGGCCTTTTTATGGGTCTGGTCTTGGTGGTAATATCAGAATGTTTCTATGT 

GCCACTTCCAAGGCATTTATCTGAGCGTTCTGAGCAGAATCGGAGATCAGAGGAGGCTC^ 

CTGAACAGTTGCAGGATGCGGAGGAGGAAAAAGATGATTCAAATGAAGAAGAAAACAAAGACAGC 

CTTGTAGATGATGAAGAAGAGAAAGAAGATCTTGGCGATGAGGATGAAGCAGAGGAAGAAGAGGA 

GGAGGACAACTTGGCTGCTGGTGTGGATGAGGAGAGAAGTGAGGCCAATGATCAGGGGCCCCC^ 

GAGAGGACGGTGTGACCCGGGAGGAAGTAGAGCCTGAGGAGGCTGAAGAAGGCATCTCTGAGCAA 

CCCTGCCCAGCTGACACAGAGGTGGTGGAAGACTCCTTGAGGCAGCGTAAAAGTCAGCATGCTGA 

CAAGGGACTGTAGATTTAATGATGCGTTTTCAAGAATACACACCiy^ 

TTTGGCCTGCAGTTTGTACCAAATCCTTAATTTTTCCTGAATGAGCAAGCTTCTCTTAAAAGATG. 

CTCTCTAGTCATTTGGTCTCATGGCAGTAAGCCTCATGTATACTAAGGAGAGTCTTCCAGGTGTG 

ACAATCAGGATATAGAAAAACAAACGTAGTGTTGGGATCTGTTTGGAGACTGGGATGGGAACAAG 

TTCATTTACTTAGGGGTCAGAGAGTCTCGACCAGAGGAGGCCATTCCCAGTCCTAATCAGCACCT 

TCCAGAGACAAGGCTGCAGGCCCTGTGAAATGAAAGCCAAGCAGGAGCCTTGGCTC^ 

CCCAAAGTGTAACGTAGAAGCCTTGCATCCTTTTCTTGTGTAAAGTATTTATTTTTGTOVAATTG 

CA.GGAAACATCAGGCACCACAGTGCATGAAAAATCTTTCACAGCTAGAAATTGAAAGGGCCTTGG 

GTATAGAGAGCAGCTCAGAAGTCATCCCAGCCCTCTGAATCTCCTGTGCTATGTTTTATTTCTTA 

CCTTTAATTTTTCCAGCATTTCCACCATGGGCATTCAGGCTCTCCACACTCTTCACTATTATCTC 

TTGGTCAGAGGACTCCAATAACAGCCAGGTTTACATGAACTGTGTTTGTTCATTCTGACCTAAGG 

GGTTTAGATAATCAGTAACCATAACCCCTGAAGCTGTGACTGCCAAACATCTCAAATGAAATGTT 

GTGGCCATCAGAGACTCAAAAGGAAGTAAGGATTTTACAAGACAGATTAAAAAAAAATTGTTTTG 

TCCAAAATATAGTTGTTGTTGATTTTTTTTTAAGTTTTCTAAGCAATATTTTTCAAGCGAGAAGT 

CCTCTAAGTCTTGCCAGTACAAGGTAGTCTTGTGAAGAAAAGTTGAATACTGTTTTGTTTTCATC 

TCAAGGGGTTCCCTGGGTCTTGAACTACTTTAATAATAACTAAAAAACCACTTCTGATTTTCCTT 

CAGTGATGTGCTTTTGGTGAAAGAATTAATGAACTCCAGTACCTGAAAGTGAAAGATTTGATTTT 

GTTTCCATCTTCTGTAATCTTCCAAAGAATTATATCTTTGTAAATCTCTCAATACTCAATCTACT 

GTAAGTACCCAGGGAGGCTAATTTCTTT 



FIGURE 200 



</usr/seqdb2/sst/DNA/Dnaseqs •min/ss .DNA56433 
<subunit 1 of 1, 349 aa, 1 stop 
<MW: 38952, pi: 4.34, NX(S/T): 1 



MAGGRCGPQLTALIiAAWIAAVAATAGPEEAALPPEQSRVQPMTASNWTLVMEGEWMLKFYAPWCP 

SCQQTDSEWEAFAKNGE ILQI SVGKVDVIQEPGLSGRFFVTTLPAFFHAKDGI FRRYRGPGI FED 

LQNYILEKKWQSVEPLTGWKSPASLTMSGMAGLFSISGKIWHLHNYFTWLGIPAWCSYVFF^^ 

TLVFGLFMGLVLWISECFYVPLPRHLSERSEQNRRSEEAHRAEQLQDAEEEKDDSNEEENKDSL. 

VDDEEEKEDLGDEDEAEEEEEEDNLAAGVDEERSEANDQGPPGEDGVTREEVEPEEAEEGISEQP 

CPADTEWEDSLRQRKSQHADKGL 

Important features : 
Signal peptide: 

amino acids 1-22 

Transmembrane domain : 

amino acids 191-211 

N-glycosylation site. 

amino acids 46-49 

Thioredoxin family proteins. (homologous region to disulfide 

isomerase) 

amino acids 56-72 



Flavodoxin proteins 

amino acids 173-187 



FIGURE 201 

ATCTGGTTGAACTACTTAAGCTTAATTTGTTAAACTCCGGTAAGTACCTAGCCCACATGATTTGA 
CTCAGAGATTCTCTTTTGTCCACAGACAGTCATCTCAGGGGCAGAAAGAAAAGAGCTCCCAAATG 
CTATATCTATTCAGGGGCTCTCAAGAACAATGGAATATCATCCTGATTTAGAAAATTTGGATGAA 
GATGGATATACTCAATTACACTTCGACTCTCAAAGCAATACCAGGATAGCTGTTGTTTCAGAGAA 
AGGATCGTGTGCTGCATCTCCTCCTTGGGGCCTCATTGCTGTAATTTTGGGAATCCTATGCTTGG 
TAATACTGGTGATAGCTGTGGTCCTGGGTACCATGGGGGTTCTTTCCAGCCCTTGTCCTCCTAAT 
TGGATTATATATGAGAAGAGCTGTTATCTATTCAGCATGTCACTAAATTCCTGGGATGGAAGTAA 
AAGACAATGCTGGCAACTGGGCTCTAATCTCCTAAAGATAGACAGCTCAAATGAATTGGGATTTA 
TAGTAAAACAAGTGTCTTCCCAACCTGATAATTCATTTTGGATAGGCCTTTCTCGGCCCCAGACT 
GAGGTACCATGGCTCTGGGAGGATGGATCAACATTCTCTTCTAACTTATTTCAGATCAGAACCAC 
AGCTACCCAAGAAAACCCATCTCCAAATTGTGTATGGATTCACGTGTCAGTCATTTATGACCAAC 
TGTGTAGTGTGCCCTCATATAGTATTTGTGAGAAGAAGTTTTCAAT GTAAG AGGAAGGGTGGAGA 
AGGAGAGAGAAATATGTGAGGTAGTAAGGAGGACAGAAAACAGAACAGAAAAGAGTAACAGCTGA 
GGTCAAGATAAATGCAGAAAATGTTTAGAGAGCTTGGCCAACTGTAATCTTAACCAAGAAATTGA 
AGGGAGAGGCTGTGATTTCTGTATTTGTCGACCTACAGGTAGGCTAGTATTATTTTTCTAGTTAG 
TAGATCCCTAGACATGGAATCAGGGCAGCCAAGCTTGAGTTTTTATTTTTTATTTATTTATTTTT 
TTGAGATAGGGTCTCACTTTGTTACCCAGGCTGGAGTGCAGTGGCACAATCTCGACTCACTGCAG 
CTATCTCTCGCCTCAGCCCCTCAAGTAGCTGGGACTACAGGTGCATGCCACCATGCCAGGCTAAT 
TTTTGGTGTTTTTTGTAGAGACTGGGTTTTGCCATGTTGACCAAGCTGGTCTCTAACTCCTGGGC 
TTAAGTGATCTGCCCGCCTTGGCCTCCCAAAGTGCTGGGATTACAGATGTGAGCCACCACACCTG 
GCCCCAAGCTTGAATTTTCATTCTGCCATTGACTTGGCATTTACCTTGGGTAAGCCATAAGCGAA 
TCTTAATTTCTGGCTCTATCAGAGTTGTTTCATGCTCAACAATGCCATTGAAGTGCACGGTGTGT 
TGCCACGATTTGACCCTCAACTTCTAGCAGTATATCAGTTATGAACTGAGGGTGAAATATATTTC 
TGAATAGCTAAATGAAGAAATGGGAAAAAATCTTCACCACAGTCAGAGCAATTTTATTATTTTCA 
TCAGTATGATCATAATTATGATTATCATCTTAGTAAAAAGCAGGAACTCCTACTTTTTCTTTATC 
AATTAAATAGCTCAGAGAGTACATCTGCCATATCTCTAATAGAATCTTTTTTTTTTTTTTTTTTT 
TTTGAGACAGAGTTTCGCTCTTGTTGCCCAGGCTGGAGTGCAACGGCACGATCTCGGCTCACCGC 
AACCTCCGCCCCCTGGGTTCAAGCAATTCTCCTGCCTCAGCCTCCCAAGTAGCTGGGATTACAGT 
CAGGCAGCACCACACCCGGCTAATTTTGTATTTTTTTAGTAGAGACAGGGTTTCTCCATGTCGGT 
CAGGGTAGTCCCGAACTCCTGACCTCAAGTGATCTGCCTGCCTCGGCCTCCCAAGTGCTGGGATT 
ACAGGCGTGAGCCACTGCACCCAGCCTAGAATGTTGTATAATATGTAATTGTAGGGAAACTGCTC 
TCATAGGAAAGTTTTCTGCTTTTTAAATACAAAAATACATAAAAATACATAAAATCTGATGATGA 
ATATAAAAAAGTAACCAACCTCATTGGAACAAGTATTAACATTTTGGAATATGTTTTATTAGTTT 
TGTGATGTACTGTTTTACAATTTTTACCATTTTTTTCAGTAATTACTGTAAAATGGTATTATTGG 
AATGAAACTATATTTCCTCATGTGCTGATTTGTCTTATTTTTTTCATACTTTCCCACTGGTGCTA 
TTTTTATTTCCAATGGATATTTCTGTATTACTAGGGAGGCATTTACAGTCCTCTAATGTTGATTA 
ATATGTGAAAAGAAATTGTACCAATTTTACTAAATTATGCAGTTTAAAATGGATGATTTTATGTT 
ATGTGGATTTCATTTCAATAAAAAAAAACTCTTATCAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAA 



FIGURE 202 

</usr/seqdb2/sst/DNA/Dnaseqs .min/ss •DNA53912 
<subunit 1 of 1, 201 aa, 1 stop 
<MW: 22563, pi: 4.87, NX (S/T) : 1 

ME YHPDLENLDEDGYTQLHFDSQSNTR I AWS EKGS CAAS PPWRL I AVI LGI LCLVI LVI AWLG 
TMGVLSSPCPPNWIIYEKSCYLFSMSLNSWDGSKRQCWQLGSNLLKIDSSNELGFIVKQVSSQPD 
NSFWIGLSRPQTEVPWLWEDGSTFSSNLFQIRTTATQENPSPNCVWIHVSVIYDQLCSVPSYSIC 
EKKFSM 

Important features : 

Type II transmembrane domain: 

amino acids 45-65 

cAMP- and cGMP- dependent protein kinase phosphorylation site. 

amino acids 197-200 

N-myristoylation sites. 

amino acids 35-40 and 151-156 



Homologous region to LDL receptor 

amino acids 34-67 and 70-200. 



FIGURE 203 



GGAAGGGGAGGAGCAGGCCACACAGGCACAGGCCGGTGAGGGACCTGCCCAGACCTGGAGGGTCTCGCTCTGTCACAC 

AGGCTGGAGTGCAGTGGTGTGATCTTGGCTCATCGTAACCTCCACCTCCCGGGTTCAAGTGATTCTCATGCCTCAGCC 

TCCCGAGTAGCTGGGATTACAGGTGGTGACTTCCAAGAGTGACTCCGTCGGAGGAAA ATGA CTCCCCAGTCGCTGCTG 

CAGACGACACTGTTCCTGCTGAGTCTGCTCTTCCTGGTCCAAGGTGCCCACGGCAGGGGCCACAGGGAAGACTTTCGC 

TTCTGCAGCCAGCGGAACCAGACACAGAGGAGCAGCCTCCACTACAAACCCACACCAGACCTGCGCATCTCCATCGAG 

AACTCCGAAGAGGCCCTCACAGTCCATGCCCCTTTCCCTGCAGCCCACCCTGCTTCCqGATCCTTCCCTGACCCCAGG 

GGCCTCTACCACTTCTGCCTCTACTGGAACCXSACATGCTGGGAGATTACATCTTCTCTATGGCAAGCGTGACTTCTTG 

CTGAGTGACyUUVGCCTCTAGCCTCCTCTGCTTCCAGCACCAGGAGGAGAGCCTGGCTCAGGGCCCCCCGCT 

ACTTCTGTCACCTCCTGGTGGAGCCCTCAGAACATCAGCCTGCCCAGTGCCGCCAGCTTCACCTTCTCCTTCCAC^ 

CCTCCCCy^CACGGCCGCTCACAATGCCTCGGTGGACATGTGCGAGCTCAAAAGGGACCTCCy^GCTGCTC^ 

CTGAAGCATCCCCAGAAGGCCTCAAGGAGGCCCTCGGCTGCCCCCGCCAGCCAGCAGTTGCAGAGCCTGGAGTCGAfl^ 

CTGACCTCTGTGAGATTCATGGGGGACATGGTGTCCTTCGAGGAGGACCGGATCAACGCCACGGTGl^ 

CCCACAGCCGGCCTCCAGGACCTGCACATCCACTCCCGGCAGGAGGAGGAGCAGAGCGAGATCATGGAGTACTCGGTG 

CTGCTGCCTCGAACACTCTTCCAGAGGACGAAAGGCCGGAGCGGGGAGGCTGAGAAGAGACTCCTCCTGGTGGACTTC 

AGCAGCCAAGCCCTGTTCCAGGACAAGAATTCCAGCCAAGTCCTGGGTGAGAAGGTCTTGGGGATTGTGGTACAGAAC 

ACCAAAGTAGCCAACCTCACGGAGCCCGTGGTGCTCACTTTCCAGCACCAGCTACAGCCGAAGAATGTGACTCTGCAA 

TGTGTGTTCTGGGTTGAAGACCCCACATTGAGCAGCCCGGGGCATTGGAGCAGTGCTGGGTGTGAGACCGTCAGGAGA 

GAAACCCAAACATCCTGCTTCTGCAACCACTTGACCTACTTTGCAGTGCTGATGGTCTCCTCGGTGGAGGTGGACGCC 

GTGCACAAGCACTACCTGAGCCTCCTCTCCTACGTGGGCTGTGTCGTCTCTGCCCTGGCCTGCCTTGTCACCATTGCC 

GCCTACCTCTGCTCCAGGGTGCCCCTGCCGTGCAGGAGGAAACCTCGGGACTACACCATCAAGGTGCACATGAACCTG 

CTGCTGGCCGTCTTCCTGCTGGACACGAGCTTCCTGCTCAGCGAGCCGGTGGCCCTGACAGGCTCTGAGGCTGGCTGC 

CGAGCCAGTGCCATCTTCCTGCACTTCTCCCTGCTCACCTGCCTTTCCTGGATGGGCCTCGAGGGGTACAACCTCTAC 

CGACTCGTGGTGGAGGTCTTTGGCACCTATGTCCCTGGCTACCTACTCAAGCTGAGCGCCATGGGCTGGGGCTTCCCC 

ATCTTTCTGGTGACGCTGGTGGCCCTGGTGGATGTGGACAACTATGGCCCCATCATCTTGGCTGTGCATAGGACTCCA 

GAGGGCGTCATCTACCCTTCCATGTGCTGGATCCGGGACTCCCTGGTCAGCTACATCACCAACCTGGGCCTCTTCAGC 

CTGGTGTTTCTGTTCAACATGGCCATGCTAGCCACCATGGTGGTGCAGATCCTGCGGCTGCGCCCCCACACCCAAAAG 

TGGTCACATGTGCTGACACTGCTGGGCCTCAGCCTGGTCCTTGGCCTGCCCTGGGCCTTGATCTTCTTCTCCTTTGCT 

TCTGGCACCTTCCAGCTTGTCGTCCTCTACCTTTTCAGCATCATCACCTCCTTCCAAGGCTTCCTCATCTTCATCTGG 

TACTGGTCCATGCGGCTGCAGGCCCGGGGTGGCCCCTCCCCTCTGAAGAGCAACTCAGACAGCGCCAGGCTCCCCATC 

AGCTCGGGCAGCACCTCGTCCAGCCGCAT CTAGG CCTCCAGCCCACCTGCCCATGTGATGAAGCAGAGATC 

GTCGCACACTGCCTGTGGCCCCCGAGCCAGGCCCAGCCCCAGGCCAGTCAGCCGCAGACTTTGGAAAGCCCAACG^ 

ATGGAGAGATGGGCCGTTGCCATGGTGGACGGACTCCCGGGCTGGGCTTTTGAATTGGCCTTGGGGACTACTCG^ 

TCACTCAGCTCCCACGGGACTCAGAAGTGCGCCGCCT^TGCTGCCTAGGGTACTGTCCCCACATCTGTCCC^ 

TGGAGGCCTGGTCTCrCCTTACAACCCCTGGGCCCAGCCCTCATTGCTGGGGGCCAGGCCTTGGATCrTGAGG^ 

GCACATCCTTAATCCTGTGCCCCTGCCTGGGACAGAAATGTGGCTCCAGTTGCTCTGTCTCTCGTGGTCACCCTGAC^ 

GCACTCTGCATCCTCTGTCATTTTAACCTCAGGTGGCACCCAGGGCGAATGGGGCCCAGGGCAGACCTTCT^GGGCCA 

AGCCCTGGCGGAGGAGAGGCCCTTTGCCAGGAGCACAGCAGCAGCTCGCCTACCTCTGAGCCCAGGCCCCCTCCCTCC 

CTCAGCCCCCCAGTCCTCCCTCGATCTTCCCTGGGGTTCTCCTCCTCTCCCAGGGCCTCCTTGCTCCTTCGTTCACAG 

CTGGGGGTCCCCGATTCCAATGCTGTTTTTTGGGGAGTGGTTTCCAGGAGCTGCCTGGTGTCTGCTGTAAATGTTTGT 

CTACTGCACAAGCCTCGGCCTGCCCCTGAGCCAGGCTCGGTACCGATGCGTGGGCTGGGCTAGGTCCCTCTGTCCATC 

TGGGCCTTTGTATGAGCTGCATTGCCCTTGCTCACCCTGACCAAGCACACGCCTCAGAGGGGCCCTCAGCCTCTCCTG 

AAGCCCTCTTGTGGCAAGAACTGTGGACCATGCCAGTCCCGTCTGGTTTCCATCCCACCACTCCAAGGACTGAGACTG 

ACCTCCTCTGGTGACACTGGCCTAGAGCCTGACACTCTCCTAAGAGGTTCTCTCCAAGCCCCCAAATAGCTCCAGGCG 

CCCTCGGCCGCCCATCATGGTTAATTCTGTCCAACAAACACACACGGGTAGATTGCTGGCCTGTTGTAGGTGGTAGGG 

ACACAGATGACCGACCTGGTCACTCCTCCTGCCAACATTCAGTCTGGTATGTGAGGCGTGCGTGAAGCAAGAACTCCT 

GGAGCTACAGGGACAGGGAGCCATCATTCCTGCCTGGGAATCCTGGAAGACTTCCTGCAGGAGTCAGCGTTCAATCTT 

GACCTTGAAGATGGGAAGGATGTTCTTTTTACGTACCAATTCTTTTGTCTTTTGATATTAAAAAGAAGTACATGTTCA 

ttgtagagaatttggaaactgtagaagagaatcaagaagaaaaataaaaatcagctgttgtaatcgcctagcaaaaaa 
at^aaaaatvaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa/^aaaaaaaaaaaaaaaaa;^^ 



FIGURE 204 



</usr/seqdb2/sst/DNA/Dnaseqs .min/ss . DNA50921 
<subunit 1 of 1, 693 aa, 1 stop 
<MW: pi: 8.87, NX(S/T.): 7 

MTPQSLLQTTLFLLSLLFLVQGAHGRGHREDFRFCSQRNQTHRSSLHYKPTPDLRISIENSEEAL 

TVHAPFPAAHPASRSFPDPRGLYHFCLYWNRHAGRLHLLYGKRDFLLSDKASSLLCFQHQEESIiA 

QGPPLLATSVTSWWSPQNISLPST^SFTFSFHSPPHTAAHNASVDMCELKRDLQLLSQFLKHPQK 

ASRRPSAAPASQQLQSLESKLTSVRFMGDMVSFEEDRINATWKLQPTAGLQDLHIHSRQEEEQS 

EIMEYSVLLPRTLFQRTKGRSGEAEra^LLLVDFSSQALFQDKNSSQVLGEKVLGIWQNTKVANL 

TEPVVLTFQHQLQPKNWLQCVFWEDPTLSSPGHWSSAGCETVRRETQTSCFCNHLTYFAVLl^ 

SSVEVDAVHKHYLSLLSYVGCWSALACLVTIAAYLCSRVPLPCRRKPRDYTIKVHMNLLLAVFL 

LDTSFLLSEPVALTGSEAGCRASAIFLHFSLLTCLSWMGLEGYNLYRLVVEVFGTYVPGYLLKLS 

AMGWGFPIFLVTLVALVDVDNYGPIILAVHRTPEGVIYPSMCWIRDSLVSYITNLGLFSLVFLFN 

MAMI^TNTVVQILRLRPHTQKWSHVLTLLGLSLVLGLPWALIFFSFASGTFQL 

GFLIFIWYWSMRLQARGGPSPLKSNSDSARLPISSGSTSSSRI 

Important features: 
Signal peptide: 

amino acids 1-25 

Putative transmembrane domains: 

amino acids 382-398, 402-420, 445-468, 473-491, 519-537, 568-590 
and 634-657 

Microbodies C- terminal targeting signal. 

amino acids 691-693 

cAMP- and cGMP- dependent protein kinase phosphorylation sites. 

amino acids 198-201 and 370-373 
N-glycosylation sites. 

amino acids 39-42, 148-151, 171-174, 234-237, 303-306, 324-327 
and 341-344 

G-protein coupled receptors family 2 proteins 

amino acids 475-504 



FIGURE 205 

TGCCTGGCCTGCCTTGTCAACAATGCCGCTTACTCTGCTTCCAGGTTGCCCTGCCTTGCAGAGGA 

AANCNTCGGGACTACACC^^^CAAGTGCACATGAACCTGCTGCTGGCCGTCTTC^^ 

AGCTTCCTGCTCAGCGNAGCCGGTGGCCCTGACA,GGCTCTGAAGGCTGGCTGCCGAGCCAGTGC 

ATCTTCCTGCACTTCTCCTGCTCACCTGCCTTTCCTGGATGGGCCTCGAGGGGTACAACCTCTAC 

CGACTCGTGGTGGAGGTCTTTGGCACCTATGTCCCTGGCTACCTACTCAAGCTGAGCGCCATGC^ 

CTGGGGCTTCCCCATCTTTCTGGTGACGCTGGTGGCCCTGGTGGATGTGGACAACTATGGCCC 

TCATCTTGGCTGTGCATAGGACTCCAGAGGGCGTCATCTACCCTTCCATGTGCTGGATCCGGGAC 

TCCCTGGTCT^GCTACATCACCAACCTGGGCCTCTTCAGCCTGGTGTTTCTGTTC^ 




FIGURE 206 

CGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGCTGGTTCAGGTCCAGGTTTTGCTTTGATCC 

TTTTCAAAAACTGGAGACACAGAAGAGGGCTCTAGGAAAAAGTTTTGGATGGGATTATGTGGAAACTACCCTGCGATT 

CTCTGCTGCCAGAGCAGGCTCGGCGCTTCCACCCCAGTGCAGCCTTCCCCTGGCGGTGGTGAAAGAGACTCGGGAGTC 

GCTGCTTCCAAAGTGCCCGCCGTGAGTGAGCTCTCACCCCAGTCAGCCAAATGAGCCTCTTGGGGCTTCTCCTGCTGA 

CATCTGCCCTGGCCGGCCAGAGACAGGGGACTCAGGCGGAATCCAACCTGAGTAGTAAATTCCAGTTTTCCAGCAACA 

AGGAACA.G7UVCGGAGTACAAGATCCTCAGCATGAGAGAATTATTACTGTGTCTACTAATGGAAGTATTCACAGCCCAA 

GGTTTCCTCATACTTATCCAAGAAATACGGTCTTGGTATGGAGATTAGTAGCAGTAGAGGAAAATGTATGGATACAAC 

TTACGTTTGATGAAAGATTTGGGCTTGAAGACCCAGAAGATGACATATGCAAGTATGATTTTGTAGAAGTTGAGGAAC 

CCAGTGATGGAACTATATTAGGGCGCTGGTGTGGTTCTGGTACTGTACCAGGAAAACAGATTTCTAAAGGAAATCAAA 

TTAGGATAAGATTTGTATCTGATGAATATTTTCCTTCTGAACCAGGGTTCTGCATCCACTACAACATTGTCATC 

AATTCACAGAAGCTGTGAGTCCTTCAGTGCTACCCCCTTCAGCTTTGCCACTGGACCTGCTTAATAATGCTATAACTG 

CCTTTAGTACCTTGGAAGACCTTATTCGATATCTTGAACC7VGAGAGATGGC7VGTTGGACTTAGAA(^ 

CAACTTGGCAACTTCTTGGCAAGGCTTTTGTTTTTGGAAGAAAATCCAGAGTGGTGGATCTGAACCTTCTAACAGA^ 

AGGTAAGATTATACT^GCTGCACACCTCGTAACTTCTCAGTGTCCATAAGGGAAGAACTAAAGAGAACC^ 

TCTGGCCAGGTTGTCTCCTGGTTAAACGCTGTGGTGGGAACTGTGCCTGTTGTCTCCACAATTGC/^TGAATGTCAAT 

GTGTCCCAAGCAAAGTTACTAAAAAATACCACGAGGTCCTTCAGTTGAGACCAT^GACCGGTGTCAGGGGATTGCACA 

AATCACTCACCGACGTGGCCCTGGAGCACCATGAGGAGTGTGACTGTGTGTGCAGAGGGAGCACAGGAGG ATAG CCGC 

ATCACCACCAGCAGCTCTTGCCCAGAGCTGTGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGCGTTATCTCCAT 

CCTTAATCTCAGTTGTTTGCTTCAAGGACCTTTCATCTTCAGGATTTACAGTGCATTCTGAAAGAGGAGACATCAAAC 

AGAATTAGGAGTTGTGCAACAGCTCTTTTGAGAGGAGGCCTAAAGGACAGGAGAAAAGGTCTTCAATCGTGGAAAGAA 

AATTAAATGTTGTATTAAATAGATCACCAGCTAGTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCTGTAT 

TTCAGTTCTTTCGATACGGCTTAGGGTAATGTCAGTACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCC 

TTGCTTAACTCTAAAGCTCCATGTCCTGGGCCTAAAATCGTATAAAATCTGGATTTTTTTTTTTTTTTTTGCTCATAT 

TCACATATGTAAACCAGAACATTCTATGTACTACAAACCTGGTTTTTAAAAAGGAACTATGTTGCTATGAATTAAACT 

TGTGTCATGCTGATAGGACAGACTGGATTTTTCATATTTCTTATTAAAATTTCTGCCATTTAGAAGAAGAGAACTACA 

TTCATGGTTTGGAAGAGATAAACCTGAAAAGAAGAGTGGCCTTATCTTCACTTTATCGATAAGTCAGTTTATTTGTTT 

CATTGTGTACATTTTTATATTCTCCTTTTGACATTATAACTGTTGGCTTTTCTAATCTTGTTAAATATATCTATTTTT 

ACCAAAGGTATTTAATATTCTTTTTTATGACAACTTAGATCAACTATTTTTAGCTTGGTAAATTTTTCTAAACACAAT 

TGTTATAGCCAGAGGAACAAAGATGATATAAAATATTGTTGCTCTGACAAAAATACATGTATTTCATTCTCGTATGGT 

GCTAGAGTTAGATTAATCTGCATTTTAAAAAACTGAATTGGAATAGAATTGGTAAGTTGCAAAGACTTTTTGAAAATA 

ATTAAATTATCATATCTTCCATTCCTGTTATTGGAGATGAAAATAAAAAGCAACTTATGAAAGTAGACATTC^ 

AGCCATTACTAACCTATTCCTTTTTTGGGGAAATCTGAGCCTAGCTCAGAAAAACATAAAGCyVCCTTGA;^^ 

GGCAGCTTCCTGATAAAGCGTGCTGTGCTGTGCAGTAGGAACACATCCTATTTATTGTGATGTTGTGGTTTTAOT 

TTAAACTCTGTTCCATACACTTGTATAAATACATGGATATTTTTATGTACAGAAGTATGTCTCTTAACCAGTTCACTT 

ATTGTACTCTGGCAATTTAAAAGAAAATCAGTAAAATATTTTGCTTGTAATVATGCTTAATATNGT^ 

GGTGACTATTTGAATCAAAAATGTATTGAATCATCAAATAAAAGAATGTGGCTATTTTGGGGAGAAAATTAAAAA 

AAAAAAAAAAAAAGGTTTAGGGATAACAGGGTAATGCGGCC 



FIGURE 207 

MSLFGLLLLTSALAGQRQGTQAESNLSSKFQFSSNKEQNGVQDPQHERI ITVSTNGS IHSPRFPH 
TYPRNTVLWRLVAVEENWIQLTFDERFGLEDPEDDICKYDFVEVEEPS.DGTILGRWCGSGTVP 
GKQISKGNQIRIRFVSDEYFPSEPGFCIHYNIVMPQFTEAVSPSVLPPSALPLDLLNNAITAFST 
LEDLIRYLEPERWQLDLEDLYRPTWQLLGKAFVFGRKSRWDLNLLTEEVRLYSCTPRNFSVSIR 
EELKRTDT I FWPGCLLVKRCGGNCT^CCLHNOJJECQCVPSKVTKKYHEVLQLRPKTGVRGLHKSLT 
DVALEHHEECDCVCRGSTGG 



S Ignal s equenc e : 

amino acids 1-14 




FIGURE 208 

CCCATCTCAAGCTGATCTTGGCACCTCTCATGCTGTGCTCTCTTCAACCAGACCTCTACATTCCATTTTGGAAGAAGA 
CTAAA AATG GTGTTTCCAATGTGGACACTGAAGAGACAAATTCTTATCCTTTTTAACATTVATCCTAATTTCCAAACTC 
CTTGGGGCTAGATGGTTTCCTAAAACTCTGCCCTGTGATGTCACTCTGGATGTTCCAAAGAACCATGTGATCGTGGAC 
TGCACAGACAAGCATTTGACAGAAATTCCTGGAGGTATTCCCACGAACACCACGAACCTCACCCTCACCATTAACCAC 
ATACCAGACATCTCCCCAGCGTCCTTTCACAGACTGGACCATCTGGTAGAGATCGATTTCAGATGCAACTGTGTACCT 
ATTCCACTGGGGTCAAAAAACAACATGTGCATCAAGAGGCTGCAGATTAAACCCAGAAGCTTTAGTGGACTCACTTAT 
TTAAAATCCCTTTACCTGGATGGAAACCAGCTACTAGAGATACCGCAGGGCCTCCCGCCTAGCTTACAGCTTCTCAGC 
CTTGAGGCCAACAACATCTTTTCCATCAGAAAAGAGAATCTAACAGAACTGGCCAACATAGAAATACXeTACCTGGGC 
CAAAACTGTTATTATCGAAATCCTTGTTATGTTTCATATTCAATAGAGAAAGATGCCTTCCTAAACTTGACAAAGTTA 
AAAGTGCTCTCCCTGAAAGATAACAATGTCACAGCCGTCCCTACTGTTTTGCCATCTACTTTAACAGAACTATATCTC 
TACAACAACATGATTGCy^AAAATCCT^GAAGATGATTTTAATAACCTCAACC^ 

AATTGCCCTCGTTGTTATAATGCCCCATTTCCTTGTGCGCCGTGTAAAT^TAATTCTCCCCTACAGATCCCTGTAAAT 

GCTTTTGATGCGCTGACAGAATT7UUVAGTTTTACGTCTACACAGTAACTCTCTTCAGCATGTC 

AAGAACATCAACAAACTCCAGGAACTGGAtCTGTCCCAAAACTTCTTGGCCTUVAGAAATTGG^ 

CATTTTCTCCCCAGCCTCATCCAATTGGATCTGTCTTTCAATTTTG7\ACTTCAGGTCTATCGTGCATCTATC 

TC^VCAAGCTVTTTTCTTCACTGAAAAGCCTGAAAATTCTGCGGATCAGAGGATATGTCTTTAAAGAGTTGAAA^ 

AACCTCTCGCCATTACATAATCTTCAAAATCTTGAAGTTCTTGATCTTGGCACTAACTTTATAAAAATTGCTAACCTC 

AGCATGTTTAAACAATTTAAAAGACTGAAAGTCATAGATCTTTCAGTGAATAAAATATCACCTTCAGGAGATTCAAGT 

GAAGTTGGCTTCTGCTCAAATGCCAGAACTTCTGTAGAAAGTTATGAACCCCAGGTCCTGGAACAATTACATTATTTC 

AGATATGATAAGTATGCAAGGAGTTGCAGATTCAAAAACAAAGAGGCTTCTTTCATGTCTGTTAATGAAAGCTGCTAC 

AAGTATGGGCAGACCTTGGATCTTU^GTAAAAATAGTATATTTTTTGTCAAGTCCTCTGATTTTCAGCATCTTTCTTTC 

CTCAAATGCCTGAATCTGTCAGGAAATCTCATTAGCCAAACTCTTAATGGCAGTGAATTCCAACCTTTAGCAGAGCTG 

AGATATTTGGACTTCTCCAACAACCGGCTTGATTTACTCCATTCAACAGCATTTGAAGAGCTTCACAAACTGGAAGTT 

CTGGATATAAGCAGTAATAGCCATTATTTTCAATCAGAAGGAATTACTCATATGCTAAACTTTACCAAGAACCTAAAG 

GTTCTGCAGAAACTGATGATGAACGACAATGACATCTCTTCCTCCACCAGCAGGACCATGGAGAGTGAGTCTCTTAGA 

ACTCTGGAATTCAGAGGAAATCACTTAGATGTTTTATGGAGAGAAGGTGATAACAGATACTTACAATTATTCAAGAAT 

CTGCTAAAATTAGAGGAATTAGACATCTCTAAAAATTCCCTAAGTTTCTTGCCTTCTGGAGTTTTTGATGGTATGCCT 

CCAAATCTAAAGAATCTCTCTTTGGCCAAAAATGGGCTCAAATCTTTCAGTTGGAAGAAACTCCAGTGTCTAAAGAAC 

CTGGAAACTTTGGACCTCAGCCACAACCAACTGACCACTGTCCCTGAGAGATTATCCAACTGTTCCAGAAGCCTC7VAG 

AATCTGATTCTTAAGAATAATCAAATCAGGAGTCTGACGAAGTATTTTCTACAAGATGCCTTCCAGTTGCGATATCTG 

GATCTCAGCTCAAATAAAATCCAGATGATCCAAAAGACCAGCTTCCCAGAAAATGTCCTCAACAATCTGAAGATGTTG 

CTTTTGCATCATAATCGGTTTCTGTGCACCTGTGATGCTGTGTGGTTTGTCTGGTGGGTTAACCATACGGAGGTGACT 

ATTCCTTACCTGGCCACAGATGTGACTTGTGTGGGGCCAGGAGCACACAAGGGCCAAAGTGTGATCTCCCTG^ 

TACACCTGTGAGTTAGATCTGACTAACCTGATTCTGTTCTCACTTTCCATATCTGTATCTCTCTTTCTCATGGTGATG 

ATGACAGCAAGTCACCTCTATTTCTGGGATGTGTGGTATATTTACCATTTCTGTAAGGCCAAGATAAAGGGGTATCAG 

CGTCTAATATCACCAGACTGTTGCTATGATGCTTTTATTGTGTATGACACTAAAGACCCAGCTGTGACCGAGTGGGTT 

TTGGCTGAGCTGGTGGCCAAACTGGAAGACCCAAGAGAGAAACATTTTAATTTATGTCTCGAGGAAAGGC^ 

CCAGGGCAGCCAGTTCTGGAAAACCTTTCC<^GAGCATACAGCTTAGCAAAAAGACAGTGTTTGTGATGAC^ 

TATGCAAAGACTGAAAATTTTAAGATAGCATTTTACTTGTCCCTVTCAGAGGCTCATGGATGAAAAAGT^ 

ATCTTGATATTTCTTGAGAAGCCCTTTCAGAAGTCCAAGTTCCTCCAGCTCCGGAAAAGGCTCTGTGGGAGTTCTGTC 

CTTGAGTGGCCAACAAACCCGCTU^GCTCACCCATACTTCTGGCT^GTGTCTAAAGAACGCCCTGGCC^ 

GTGGCCTATAGTCAGGTGTTCAAGGAAACGGT CTAG CGCTTCTTTGCAAAACACTACTGCCTAGTTTACC^ 

GCCTGGC 



FIGURE 209 



IWFPMWTLKRQI LILFNI I L I SKLLGARWFPKTLPCDVTLDVPKNHVI VDCTDKHLTE I PGGI PT 
NTTNLTLT I NH I PD I S PAS FHRLDHL VE I DFRCNCV^P I PLGS KlJJNMC I 

SLYLDGNQLLEIPQGLPPSLQLLSLEANNIFSIRKENLTELANIEILYLGQNCYYRNPCYVSYSI 

EKDAFLNLTKLKVLSLKDNNVTAVPTVLPSTLTELYLYNNMIAKIQEDDFNNLNQLQIL^ 

PRCYNAPFPCAPCKNNSPLQIPWAFDALTELKVT^RLHSNSLQHVPPRWFKNINKLQELDLSQ 

LAKEIGDAKFLHFLPSLIQLDLSFNFELQVYRASMNLSQAFSSLKSLKILRIRGYVFKELKSFNL 

SPLHNLQNLEVLDLGTNFIKIANLSMFKQFKRLKVIDLSVNKISPSGDSSEVGFCSNARTSVESY 

EPQVLEQLHYFRYDKYARSCRFKNKEASFMSVNESCYKYGQTLDLSKNSIFFVKSSDFQHLSFLK 

CLNLSGNLISQTLNGSEFQPLAELRYLDFSNNRLDLLHSTAFEELHKLEVLDISSNSHYFQSEGI 

THMLNFTKNLK\n^QKLMMNDNDISSSTSRTMESESLRTLEFRGNHLDVLWREGDNRYLQLFKNLL 

KLEELDISKNSLSFLPSGVFDGMPPNLKNLSIiAKNGLKSFSWKKLQCLKNLETLDLSHNQLTTVP 

ERLSNCSRSLKNLILKNNQIRSLTKYFLQDAFQLRYLDLSSNKIQMIQKTSFPENVLNNLKMLLL 

HHNRFLCTCDAWFVWWVNHTEVTIPYLATDVTCVGPGAHKGQSVISLDLYTCELDLTNLILFSL 

SISVSLFLiyr\mMTASHLYFWDVWYIYHFCKAKIKGYQRLISPDCCYDAFIVYDTKDPAVTEW^ 

ELVAKLEDPREKHFNLCLEERDWLPGQPVLENIiSQSIQLSKKTVFVMTDKYAKTENFKIAFYLSH 

QRLMDEKVDVIILIFLEKPFQKSKFLQLRKRLCGSSVLEWPTNPQAHPYFWQCLKNALATDNHVA 

YSQVFKETV 

Signal sequence: 

amino acids 1-26 

Transmembrane domain: 

amino acids 840-860 




FIGURE 210 

GGGTACCATTCTGCGCTGCTGCAAGTTACGGAATGAAAAATTAGAACAACAGAAAC ATGG AAAACATGTTCCTTCAGT 

CGTCAATGCTGACCTGCATTTTCCTGCTAATATCTGGTTCCTGTGAGTTATGCGCCGAAGAAAATTTTTCTAGAAGCT 

ATCCTTGTGATGAGAAAAAGCAAAATGACTCAGTTATTGCAGAGTGCAGCAATCGTCGACTACAGGAAGTTCCCCAAA 

CGGTGGGCAAATATGTGACAGAACTAGACCTGTCTGATAATTTCATCACACACATAACGAATGAATCATTTCAAGGGC 

TGCAAAATCTCACTAAAATAAATCTAAACCACAACCCCAATGTACAGCACCAGAACGGAAATCCCGGTATACAATCAA 

ATGGCTTGAATATCACAGACGGGGCATTCCTCAACCTAAAAAACCTAAGGGAGTTACTGCTTGAAGACAACCAGTTAC 

CCCAAATACCCTCTGGTTTGCCAGAGTCTTTGACAGAACTTAGTCTAATTCAAAACAATATATACAACATAACTAAAG 

AGGGCATTTCAAGACTTATAAACTTGAAAAATCTCTATTTGGCCTGGAACTGCTATTTTAACAAAGTTTGCGAGAAAA 

CTAACATAGAAGATGGAGTATTTGAAACGCTGACAAATTTGGAGTTGCTATCACTATCTTTCAATTCTCTTTCACACG 

TGCCACCCAAACTGCCAAGCTCCCTACGCAAACTTTTTCTGAGCAACACCCAGATCAAATACATTAGTGAAGAAGATT 

TCAAGGGATTGATAAATTTAACATTACTAGATTTAAGCGGGAACTGTCCGAGGTGCTTCAATGCCCCATTTCCATGCG 

TGCCTTGTGATGGTGGTGCTTCAATTAATATAGATCGTTTTGCTTTTCAAAACTTGACCCAACTTCGATACCTAAACC 

TCTCTAGCACTTCCCTCAGGAAGATTAATGCTGCCTGGTTTAAAAATATGCCTCATCTGAAGGTGCTGGATCT^ 

TCAACTATTTAGTGGGAGAAATAGTCTCTGGGGCATTTTTAACGATGCTGCCCCGCTTAGAAATACTTGACTTC 

TTAACTATATAAAGGGGAGTTATCCACAGCATATTAATATTTCCAGAAACTTCTCTAAACTTTTGTCTCTACGGGC^ 

TGCATTTAAGAGGTTATGTGTTCCAGGAACTCAGAGAAGATGATTTCCAGCCCCTGATGCAGC 

CTATCT^CTTGGGTATTAATTTTATTAAGCAAATCGATITCAAACTTTTCCAAAATTTCTCCAATCTGG 

ACTTGTCAGAAAACAGAATATCACCGTTGGTAAAAGATACCCGGCAGAGTTATGOU^TAGTTCCTCTTT^ 

ATATCCGGAAACGACGCTCAACAGATTTTGAGTTTGACCCACT^TTCGAACTTTTATCATTTCACCCGTCCT^ 

AGCCACAATGTGCTGCTTATGGAAAAGCCTTAGATTTAAGCCTCAACAGTATTTTCTTCy^TTGGGCCAAACCT^ 

AAAATCTTCCTGACATTGCCTGTTTAAATCTGTCTGCAAATAGCAATGCTCAAGTGTTAAGTGGAACTGAATTTTCAG 

CCATTCCTCATGTCAAATATTTGGATTTGACAAACAATAGACTAGACTTTGATAATGCTAGTGCTCTTACTGAATTGT 

CCGACTTGGAAGTTCTAGATCTCAGCTATAATTCACACTATTTCAGAATAGCAGGCGTAACACATCATCTAGAATTTA 

TTCAAAATTTCACAAATCTAAAAGTTTTAAACTTGAGCCACAACAACATTTATACTTTAACAGATAAGTATAACCTGG 

AAAGCAAGTCCCTGGTAGAATTAGTTTTCAGTGGCAATCGCCTTGACATTTTGTGGAATGATGATGACAACAGGTATA 

TCTCCATTTTCAAAGGTCTCAAGAATCTGACACGTCTGGATTTATCCCTTAATAGGCTGAAGCACATCCCAAATGAAG 

CATTCCTTAATTTGCCAGCGAGTCTCACTGAACTACATATAAATGATAATATGTTAAAGTTTTTTAACTGGACATTAC 

TCCAGCAGTTTCCTCGTCTCGAGTTGCTTGACTTACGTGGAAACAAACTACTCTTTTTAACTGATAGCCTATCTGACT 

TTACATCTTCCCTTCGGACACTGCTGCTGAGTCATAACAGGATTTCCCACCTACCCTCTGGCTTTCTTTCTGAAGTCA 

GTAGTCTGAAGCACCTCGATTTAAGTTCCAATCTGCTAAAAACAATCAACAAATCCGCACTTGAAACTAAGACCACCA 

CCAAATTATCTATGTTGGAACTACACGGAAACCCCTTTGAATGCACCTGTGACATTGGAGATTTCCGAAGATGGATGG 

ATGAACATCTGAATGTCAAAATTCCCAGACTGGTAGATGTCATTTGTGCCAGTCCTGGGGATCAAAGAGGGAAGAGTA 

TTGTGAGTCTGGAGCTAACAACTTGTGTTTCAGATGTCACTGCAGTGATATTATTTTTCTTCACGTTCTTTATCACCA 

CCATGGTTATGTTGGCTGCCCTGGCTCACCATTTGTTTTACTGGGATGTTTGGTTTATATATAATGTGTGTTTAGCTA. 

AGGTAAAAGGCTACAGGTCTCTTTCCACATCCCAAACTTTCTATGATGCTTACATTTCTTATGACACCAAAGATGCCT 

CTGTTACTGACTGGGTGATAAATGAGCTGCGCTACCACCTTGAAGAGAGCCGAGACAAAAACGTTCTCCTTTGTCTAG 

AGGAGAGGGATTGGGACCCGGGATTGGCCATCATCGACAACCTCATGCAGAGCATCAACCAAAGCAAGAAAAC^ 

TTGTTTTAACCAAAAAATATGCAAAAAGCTGGAACTTTAAAACyVGCTTTTTACTTGGCTTTGCA 

AGAACATGGATGTGATTATATTTATCCTGCTGGAGCCTVGTGTTACAGCATTCTCAGTATTTGAGGCTACGGCAGCC^ 

TCTGTAAGAGCTCCATCCTCCT^GTGGCCTGACAACCCGAAGGCT^GAAGGCTTGTTTTGGCAAACTCTGAG^^ 

TCTTGACTGAAAATGATTCACGGTATAACAATATGTATGTCGATTCCATTAAGCAATACTAACTGACGTT 

ATTTCGCGCCATAATAAAGATGCAAAGGAATGACATTTCTGTATTAGTTATCTATTGiCTATGTAACAAA 

AACTTAGTGGTTTAAAACAACACATTTGCTGGCCCACAGTTTTTGAGGGTCAGGAGTCCAGGCCCAGCATAAC^ 

CCTCTGCTCAGGGTGTCTCAGAGGCTGCAATGTAGGTGTTCACCAGAGACATAGGCATCACTGGGGTCACACTCATGT 

GGTTGTTTTCTGGATTCAATTCCTCCTGGGCTATTGGCCAAAGGCTATACTCATGTAAGCCATGCGAGCCTCTCCCAC 

AAGGCAGCTTGCTTCATCAGAGCTAGCAAAAAAGAGAGGTTGCTAGCAAGATGAAGTCACAATCTTTTGTAATCGAAT 

CAAAAAAGTGATATCTCATCACTTTGGCCATATTCTATTTGTTAGAAGTAAACCACAGGTCCCACCAGCTCCATGGGA 

GTGACCACCTCAGTCCAGGGAAAACAGCTGAAGACCAAGATGGTGAGCTCTGATTGCTTCAGTTGGTCATCAACTATT 

TTCCCTTGACTGCTGTCCTGGGATGGCCTGCTATCTTGATGATAGATTGTGAATATCAGGAGGCAGGGATCACTGTGG 

ACCATCTTAGCAGTTGACCTAACACATCTTCTTTTCAATATCTAAGAACTTTTGCCACTGTGACTAATGGTCCTAATA 

TTAAGCTGTTGTTTATATTTATCATATATCTATGGCTACATGGTTATATTATGCTGTGGTTGCGTTCGGTTTTATTTA 

CAGTTGCTTTTACAAATATTTGCTGTAACATTTGACTTCT7UVGGTTTAGATGCCATTTAAGAACTGAGATGGATAGCT 

TTTAAAGCATCTTTTACTTCTTACCATTTTTTAAAAGTATGCAGCTAAATTCGAAGCTTTTGGTCTATATTGTTAATT 

GCCATTGCTGTAAATCTTAAAATGAATGAATAAAAATGTTTCATTTTACAA7UVAAAAAAAAAAAA 



FIGURE 211 

MENMFLQSSMLTCIFLLISGSCELCAEENFSRSYPCDEKKQiroSVIAECSNRRLQEVPQTVGKW 

TELDLSDNF I TH I TNES FQGLQNLTKINLNHNPNVQHQNGNPGI QSNGLN ITDGAFLNLKNLREL 

LLEDNQLPQIPSGLPESLTELSLIQNNIYNITKEGISRLINLKNLYIiAWNCYFNKVCEKTNIEDG 

VFETLTNLELLSLSFNSLSHVPPKLPSSLRKLFLSNTQIKYISEEDFKGLINLTLLDLSGNCPRC 

FNAPFPCVPCTOGASINIDRFAFQNLTQLRYLNLSSTSLRKINAAWFKNMPHLKVLDLEFNYLVG 

EIVSGAFLTMLPRLEILDLSFNYIKGSYPQHINISRNFSKLLSLRALHLRGYVFQELREDDFQPL 

MQLPNLSTINLGINFIKQIDFKLFQNFSNLEIIYLSENRISPLVKDTRQSYANSSSFQRHIRKRR 

STDFEFDPHSNFYHFTRPLIKPQCAAYGKALDLSLNSIFFIGPNQFENLPDIACLNLSANSNAQV 

LSGTEFSAIPHVKYLDLTNNRLDFDNASALTELSDLEVLDLSYNSHYFRIAGVTHHLEFIQNFTN 

LKVLNLSHNNIYTLTDKYNLESKSLVELVFSGNRLDILWNDDDNRYISIFKGLKNLTRLDLSLNR 

LKHIPNEAFLNLPASLTELHINDNMLKFFNWTLLQQFPRLELLDLRGNKLLFLTDSLSDFTSSLR 

TLLLSHNRISHLPSGFLSEVSSLKHLDLSSNLLKTINKSALETKTTTKLSMLELHGNPFECTCDI 

GDFRRWMDEHLNVKIPRLVDVIGASPGDQRGKSIVSLELTTCVSpVTAVILFFFTFFITTiyr^ 

AliAHHLFYWDWFIYNVCLAKVKGYRSLSTSQTFYDAYISYDTKDASWDWINELRYH 

KNVLLCLEERDWDPGLAI IDNLMQS INQSKKTVFVLTKKYAKSWNFKTAFYIiALQRLMDENMDVI 

I F I LLE PVLQHSQ YLRLRQR I CKS S I LQWPDNPKAEGL FWQTLRNVVLTENDSRYNNM YVDS I KQ 

Y 

Signal sequence: 

amino acids 1-26 

Transmembrane domain: 

amino acids 826-848 



FIGURE 212 

CCAGGTCCAACTGCACCTCGGTTCTATCGATTGAATTCCCCGGGGATCCTCTAGAGATCCCTCGA 

CCTCGACCCACGCGTCCGCCAAGCTGGCCCTGCACGGCTGCT^GGGAGGCTCCTGTGGACAGG^ 

AGGCAGGTGGGCCTCAGGAGGTGCCTCCAGGCGGCCAGTGGGCCTGAGGCCCCAGCAAGGGCTAG 

GGTCCATCTCCAGTCCCAGGACACAGCAGCGGCCACO^TGGCCACGCCTGGGCTCC^ 

AGCAGCCCCCAGGACCGGGGAGGCACAGGTGGCCCCCT^CCACCCGGAGGAGCAG^ 

GTCCGGGGGATGACTGATTCTCCTCCGCCAGGCCACCCAGAGGAGAAGGCC^ 

CACT^GGCCATGAGGGGCTCTCAGGAGGTGCTGCTGATGTGGCTT 

ACAGAGCACGCCTACCGGCGCGGCCGTAGGGTGTGTGCTGTCCGGGCTCACGGGGACCCTC 

CGAGTCGTTCGTGCAGCGTGTGTACCAGCCCTTCCTCACCACCTGCGACGGGCACCGGGCCTGCA 

GCACCTACCGAACCATCTATAGGACCGCCTACCGCCGCAGCCCTGGGCTGGCCCCTGCCAGGCCT 

CGCTACGCGTGCTGCCCCGGCTGGAAGAGGACCAGCGGGCTTCCTGGGGCCTGTGGAGCAGCAAT 

ATGCCAGCCGCCATGCCGGAACGGAGGGAGCTGTGTCCAGCCTGGCCGCTGCCGCTGCCCTGCAG 

GATGGCGGGGTGACACTTGCCAGTCAGATGTGGATGAATGCAGTGCTAGGAGGGGCGGCTGTCCC 

CAGCGCTGCATCAACACCGCCGGCAGTTACTGGTGCCAGTGTTGGGAGGGGCACAGCCTGTCTGC 

AGACGGTACACTCTGTGTGCCCAAGGGAGGGCCCCCCAGGGTGGCCCCCAACCCGACAGGAGTGG 

ACAGTGCAATGAAGGAAGAAGTGCAGAGGCTGCAGTCCAGGGTGGACCTGCTGGAGGAGAAGCTG 

CAGCTGGTGCTGGCCCCACTGCACAGCCTGGCCTCGCAGGCACTGGAGCATGGGCTCCCGGACCC. 

CGGCAGCCTCCTGGTGCACTCCTTCCAGCAGCTCGGCCGCATCGACTCCCTGAGCGAGCAGATTT 

CCTTCCTGGAGGAGCAGCTGGGGTCCTGCTCCTGCAAGATU^GACTCGTGACTGCCCAG 

GGCTQGACTGAGCCCCTCACGCCGCCCTGCAGCCCCCATGCCCCTGCCCAACATGCTGGGGGTCC 

AGAAGCCACCTCGGGGTGACTGAGCGGAAGGCCAGGCAGGGCCTTCCTCCTCTTCCTCCTCCCCT 

TCCTCGGGAGGCTCCCCAGACCCTGGCATGGGATGGGCTGGGATCTTCTCTGTGAATCCACCCCT 

GGCTACCCCCACCCTGGCTACCCCAACGGCATCCCAAGGCCAGGTGGGCCCTCAGCTGAGGGAAG 

GTACGAGCTCCCTGCTGGAGCCTGGGACCCATGGCAa^GGCCAGGCAGCCC^ 

GCCTCAGTGGGGGCTGCTGCCTGACCCCCAGCACAATAAAAATGAAACGTGAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAGGGCGGCCGCGACTCTAGAGTCGACCTGCAG^ 

GGCCGCCATGGCCCAACTTGTTTATTGCAGCTTATAATGGTTACAAAT 




FIGURE 213 

MRGSQEVLLMWLLVLAVGGTEHAYRPGRRVCAVRAHGDPVSESFVQRVYQPFLTTCIX3H 

RTIYRTAYRRSPGLAPARPRYACCPGWKRTSGLPGACGAAICQPPCRNGGSCVQPGRCRCPAGWR 

GDTCQSDVDECSARRGGCPQRCINTAGSYWCQCWEGHSLSADGTLCVPKGGPPRVAPNPTGVDSA 

MKEEVQRLQSRVDLLEEKLQLVLAPLHSLASQALEHGLPDPGSLLVHSFQQLGRIDSLSEQISFL 

EEQLGSCSCKKDS 



Signal sequence : 

1-19 



FIGURE 214 



GCCAGGCAGGTGGGCCTCAGGAGGTGCCTCCT^GGCGGCCAGTGGGCC 

TAGGGTCCATCTCCAGTCCCAGGACACAGCAGCGGCCT^CCATGGCCACGCCTGGGCT 

ATC7VGAGCAGCCCCTGTGGTTGGCAGCAAAGTTCAGCTTGGCTGGGCCCGCTGTGAGGGGCTTC 

CGCTACGCCCTGCGGTGTCCCGAGGGCTGAGGTCTCCTCATCTTCTCCCTAGCAGTGGATGAGCA 

ACCCAACGGGGGCCCGGGGAGGGGAACTGGCCCCGAGGGAGAGGAACCCCAAAGCCACATCTGTA 

GCCAGGATGAGCAGTGTGAATCC?^GGCAGCCCCCAGGACCGGGGAGGCACAGGTGGCCCCCACCA 

CCCGGAGGAGCAGCTCCTGCCCCTGTCCGGGGGATGACTGATTCTCCTCCGCCAGGCCACCCAGA 

GGAGAAGGCCACCCCGCCTGGAGGCACAGGCCATGAGGGGCTCTCAGGAGGTGCTGCTGATGTGG 

CTTCTGGTGTTGGCAGTGGGCGGCACAGAGCACGCCTACCGGCCCGGCCGTAGGGTGTGTGCTGT 

CCGGGCTCACGGGGACCCTGTCTCCGAGTCGTTCGTGCAGCGTGTGTACCAGCCCTTCCTCACCA 

CCTGCGACGGGCACCGGGCCTGCAGCACCTACCGAACCATCTATAGGACCGCCTACCGCCGCAGC 

CCTGGGCTGGCCCCTGCCAGGCCTCGCTACGCGTGCTGCCCCGGCTGGAAGAGGACCAGCGGGCT 

TCCTGGGGCCTGTGGAGCAGCAATATGCCAGCCGCCATGCCGGAACGGAGGGAGCTGTGTCCAGC 

CTGGCCGCTGCCGCTGCCCTGCAGGATGGCGGGGTGACACTTGCCAGTCAGATGTGGATGAATGC 

AGTGCTAGGAGGGGCGGCTGTCCCCAGCGCTGCATCAACACCGCCGGCAGTTACTGGTGCCAGTG 

TTGGGAGGGGCT^CAGCCTGTCTGCAGACGGTACACTCTGTGTGCCCAAGGGAGGGCCCCCCAGC^ 

TGGCCCCCAACCCGACAGGAGTGGACAGTGCAATGAAGGAAGT^GTGCAGAGGCTGC^ 

GTGGACCTGCTGGAGGAGAAGCTGCAGCTGGTGCTGGCCCCACTGCACAGCCT 

ACTGGAGO^TGGGCTCCCGGACCCCGGCAGCCTCCTGGTGCACTCCTTCCAGqVGCTCGGCCGCA 

TCGACTCCCTGAGCGAGCAGATTTCCTTCCTGGAGGAGCAGCTGGGGTCCTGCTCCTGCAAGAAA 

GACTC GTGA CTGCCCAGCGCTCCAGGCTGGACTGAGCCCCTCACGCCGCCCTGCAGCCCCCATGC 

CCCTGCCCAACATGCTGGGGGTCCAGAAGCCACCTCGGGGTGACTGAGCGGAAGGCC^ 

CCTTCCTCCTCTTCCTCCTCCCCTTCCTCGGGAGGCTCCCCAGACCCTGGCATGGGATGGGCTG^ 

GATCTTCTCTGTGAATCCACCCCTGGCTACCCCCACCCTGGCTACCCCAACGGCATCCCAAGra 

AGGTGGACCCTCAGCTGAGGGAAGGTACGAGCTCCCTGCTGGAGCCTGGGACCCATGGCACA^ 

CAGGCAGCCCGGAGGCTGGGTGGGGCCTCAGTGGGGGCTGCTGCCTGACCCCCAGC^ 

ATGAAACGTG 




FIGURE 215 

MRGSQEVLIiMWLLVLAVGGTEHAYRPGRRVCAVRMGDP^^ 

RTIYRTAYRRSPGLAPARPRYACCPGWKRTSGLPGACGAAICQPPCRNGGSCVQPGRCRCPAGWR 
GDTCQSDVDECSARRGGCPQRCINTAGSYWCQCWEGHSLSADGTLCVPKGGPPRVAPNPTGVDSA 
MKEEVQRLQSRVDLLEEKLQLVLAPLHSIiASQALEHGLPDPGSLLVHSFQQLGRIDSLSEQISFL 
EEQLGSCSCKKDS 



Signal sequence: 

1-19 



FIGURE 216 



CCCACGCGTCCGAAGCTGGCCCTGCACGGCTGCAAGGGAGKSCTCCTGTGGACAGGCCA^ 

GGGCCTCAGGAGGTGCCTCCTVGGCGGCCAGTGGGCCTGAGGCCCCAGCAAGGGCTAC^ 

TCCAGTCCCAGGACACAGCAGCGGCCACCTVTGGCCACGCCTGGGCTCCAGC^ 

CCAGGACCGGGGAGGCACT^GGTGGCCCCCACaVCCCGGAQGAGCAGCTCCTGCCCOT 

GATGACTGATTCTCCTCCGCCAGGCCACCCAGAGGAGAAGGCCACCCCGCCTGGAGGCACAGGCC 

ATGA GGGGCTCTCAGGAGGTGCTGCTQATCTGGCTTCTGGTGTTGGCT^GTGGGCGGCACAGAG^ 

CGCCTACCGGCCCGGCCGTAGGGTGTGTGCTGTCCGGGCTCACGGGGACCCTGTCTCCGAGTCGT 

TCGTGCAGCGTGTGTACCAGCCCTTCCTCACCACCTGCGACGGGCACCGGGCCTGCAGCACCTAC 

CGAACCATCTATAGGACCGCCTACCGCCGCAGCCCTGGGCTGGCCCCTGCCAGGCCTCGCTACGC 

GTGCTGCCCCGGCTGGAAGAGGACCAGCGGGCTTCCTGGGGCCTGTGGAGCAGCAATATGCCAGC 

CGCCATGCCGGAACGGAGGGAGCTGTGTCCAGCCTGGCCGCTGCCGCTGCCCTGCAGGATGGCGG 

GGTGACACTTGCCT^GTCT^GATGTGGATGAATGCAGTGCTAGGAGGGGCGGCTGTCCCCAGCGCTG 

CGTCAACACCGCCGGCAGTTACTGGTGCC72^GTGTTGGGAGGGGCACAGCCTGTCTGCA.GACG^ 

CACTCTGTGTGCCCAAGGGAGGGCCCCCCAGGGTGGCCCCCAACCCGACAGGAGTGGACT^ 

ATGAAGGAAGAAGTGCAGAGGCTGCAGTCCAGGGTGGACCTGCTGGAGGAGAAGCTGCAGCTGGT 

GCTGGCCCCACTGCACAGCCTGGCCTCGCAGGCACTGGAGCATGGGCTCCCGGACCCCGGCA 

TCCTGGTGCACTCCTTCCAGCAGCTCGGCCGCATCGACTCCCTGAGCGAGCAGA^ 

GAGGAGCAGCTGGGGTCCTGCTCCTGCAAGAAAGACTC GTGA CTGCCCAGCGCCCCAGGCTGGAC 

TGAGCCCCTCACGCCGCCCTGCAGCCCCC7VTGCCCCTGCCCAACATGCTGGGGGTCCAGAAGCCA 

CCTCGGGGTGACTGAGCGGAAGGCCAGGCAGGGCCTTCCTCCTCTTCCTCCTCCCCTTCCTCGGG 

AGGCTCCCCAGACCCTGGCATGGGATGGGCTGGGATCTTCTCTGTGAATCCACCCCTGGCTACCC 

CCACCCTGGCTACCCCAACGGCATCCCAAGGCCAGGTGGGCCCTCAG 

TCCCTGCTGGAGCCTGGGACCCATGGCACAGGCCAGGCAGCCCGGAGGCTGGGTGGGGCCTC^ 
GGGGGCTGCTGCCTGACCCCCAGCACAATAAAAATGAAACGTG 




FIGURE 217 

MRGSQEVLLMWLLVLAVGGTEHAYRPGRRVCAVRAHGDPVSESFVQRVYQPFLTTCDGHRACSTY 
RTIYRTAYRRSPGLAPARPRYACCPGWKRTSGLPGACGAAICQPPCRNGGSCVQPGRCRCPAGWR 
GDTCQSDVDECSARRGGCPQRCWTAGSYWCQCWEGHSLSAIX5TLCVPKGGPPRVAPNPTGVDSA 
MKEEVQRLQSRVDLLEEKLQLVLAPLHSLASQALEHGLPDPGSLLVHSFQQLGRIDSLSEQISFL 
EEQLGSCSCKKDS 



Signal sequence: 

1-19 




FIGURE 218 

GGTTGCCACAGCTGGTTTAGGGCCCCGACCACTGGGGCCCCTTGTCAGGAGGAGACAGCCTCCCGGCCCGGGGAGGAC 

AAGTCGCTGCCACCTTTGGCTGCCGACGTGATTCCCTGGGACGGTCCGTTTCCTGCCGTCAGCTGCCGGCCGAGTTGG 

GTCTCCGTGTTTCAGGCCGGCTCCCCCTTCCTGGTCTCCCTTCTCCCGCTGGGCCGGTTTATCGGGAGGAGATTGTCT 

TCCAGGGCTAGCAATTGGACTTTTGATGATGTTTGACCCAGCGGCAGGAATAGCAGGCAACGTGATTTCAAAGCTGGG 

CTCAGCCTCTGTTTCTTCTCTCGTGTAATCGCAAAACCCATTTTGGAGCAGGAATTCCAAT CATG TCTGTGATGGTGG 

TGAGAAAGAAGGTGACACGGAAATGGGAGAAACTCCCAGGCAGGAACACCTTTTGCTGTGATGGCCGCGTCATGATGG 

CCCGGCAAAAGGGCATTTTCTACCTGACCCTTTTCCTCATCCTGGGGACATGTACACTCTTCTTCGCCTTTGAGTGCC 

GCTACCTGGCTGTTCAGCTGTCTCCTGCCATCCCTGTATTTGCTGCCATGCTCTTCCTTTTCTCCATGGCTACACTGT 

TGAGGACCAGClTCAGTGACCCTGGAGTGATTCCTCGGGCGCTACCAGATGAAGaVGCTTTCATAGAAATGGAGA 

AAGCTACCAATGGTGCGGTGCCCCAGGGCCAGCGACCACCGCCTCGTATCAAGAATTTCCAGATAAACAACCAGATTG 

TGAAACTGAAATACTGTTACACATGCAAGATCTTCCGGCCTCCCCGGGCCTCCCATTGCAGCATCTGTGACAACTGTG 

TGGAGCGCTTCGACCATCACTGCCCCTGGGTGGGGAATTGTGTTGGAAAGAGGAACTACCGCTACTTCTACCTCTTC^ 

TCCTTTCTCTCTCCCTCCTCACAATCTATGTCTTCGCCTTCAACATCGTCTATGTCGCCCTCAAATCTTTG^ 

GCTTCTTGGAGACATTGAAAGAAACTCCTGGAACTGTTCTAGAAGTCCTCATTTGCTTCTTTA^ 

TGGGACTGACTGGATTTCATACTTTCCTCGTGGCTCTCAACCAGACAACCAATGAAGACATCAAAGGATCATGGACAG 

GGAAGAATCGCGTCCAGAATCCCTACAGCCATGGCAATATTGTGAAGAACTGCTGTGAAGTGCTGTGTGGCCCCTTGC 

CCCCCAGTGTGCTGGATCGAAGGGGTATTTTGCCACTGGAGGAAAGTGGAAGTCGACCTCCCAGTACTCAAGAGACCA 

GTAGCAGCCTCTTGCCACAGAGCCCAGCCCCCACAGAACACCTGAACTCAAATGAGATGCCGGAGGACAGCAGCACTC 

CCGAAGAGATGCCACCTCCAGAGCCCCCAGAGCCACCACAGGAGGCAGCTGAAGCTGAGAAG TAG CCTATCTATGGAA 

GAGACTTTTGTTTGTGTTTAATTAGGGCTATGAGAGATTTCAGGTGAGAAGTTAAACCTGAGACAGAGAGCAAGTAAG 

CTGTCCCTTTTAACTGTTTTTCTTTGGTCTTTAGTCACCCAGTTGCACACTGGCATTTTCTTGCTGCAAGCTTTTTTA 

AATTTCTGAACTCAAGGCAGTGGCAGAAGATGTCAGTCACCTCTGATAACTGGAAAAATGGGTCTCTTGGGCCGTGGC 

ACTGGTTCTCCATGGCCTCAGCCACAGGGTCCCCTTGGACCCCCTCTCTTCCCTCCAGATCCCAGCCCTCCTGCTTGG 

GGTCACTGGTCTCATTCTG|3GGCTAAAAGTTTTTGAGACTGGCTCAAATCCTCCCAAGCTGCTGCACGTGCTGAGTCC 

AGAGGCAGTCACAGAGACCTCTGGCCAGGGGATCCTAACTGGGTTCTTGGGGTCTTCAGGACTGAAGAGGAGGGAGAG 

TGGGGTCAGAAGATTCTCCTGGCCACCAAGTGCCAGCATTGCCCACAAATCCTTTTAGGAATGGGACAGGTACCTTCC- 

ACTTGTTGTANNNNNN^^^NNNNNNNNN^^ 

TGGCAGTAATAAAAGTCTGCACTTTGGTCATTTCTTTTCCTCAGAGGAAGCCCGAGTGCTCACTTAAACACTATCCCC 
TCAGACTCCCTGTGTGAGGCCTGCAGAGGCCCTGAATGCACAAATGGGAAACCAAGGCACAGAGAGGCTCTCCTCTCC 
TCTCCTCTCCCCCGATGTACCCTCAAAAAAAAAAAAATGCTAACCAGTTCTTCCATTAAGCCTCGGCTGAGTGAGGGA 
AAGCCCAGCACTGCTGCCCTCTCGGGTAACTCACCCTAAGGCCTCGGCCCACCTCTGGCTATGGTAACCACACTGGGG 
GCTTCCTCCAAGCCCCGCTCTTCCAGCACTTCCACCGGCAGAGTCCCAGAGCCACTTCACCCTGGGGGTGGGCTGTGG 
CCCCCAGTCAGCTCTGCTCAGGACCTGCTCTATTTCAGGGAAGAAGATTTATGTATTATATGTGGCTATATTTCCTAG 
AGCACCTGTGTTTTCCTCTTTCTAAGCCAGGGTCCTGTCTGGATGACTTATGCGGTGGGGGAGTGTAAACCGGAACT^ 
TTCATCTATTTGAAGGCGATTAAACTGTGTCTAATGCA 



FIGURE 219 

MSVMVWKKVTRKWEKLPGRNTFCCDGRVMMZ^QKGIFYLTLFLILGTCTLFFAF 

PAI PVFAAMLFLFSMATLLRTS FS DPGVI PRALPDEAAF I EME I EATNGAVPQGQRP P PR I KNFQ 

INNQ I VKLKYCYTCKI FRPPRASHCS I CDNCVERFDHHCPWVGNCVGKRNYRYFYLF I LSLSLLT 

lYVFAFNIVYVALKSLKIGFLETLKETPGTVLEVLICFFTLWSWGLTGFHTFLVALNQTTNEDI 

KGSWTGKNRVQNPYSHGNIVKNCCEVLCGPLPPSVLDRRGILPLEESGSRPPSTQETSSSLLPQS 

PAPTEHLNSNEMPEDSSTPEEMPPPEPPEPPQEAAEAEK 



Putative transmembrane domains : 

amino acids 36-55 (type II TM) , 65-84, 188-208, 229-245 




FIGURE 220 

AAAACCCTGTATTTTTTACAATGKTAAATAGAC^TNANCCTGGAGGTCTTTGAAT^ 
AGGGATGGTGGGGTTGATTTTTOTTCCTGGAGGCTTTTGGCTTTGGACTCTCNCTTTCTCC^ 
GAGCNCTTCGACCATCACTGCCCCTGGGTGGGGAATTGTGTTGGAAAGAGGAACTACCGCTAOT 
CTACCTCTTCATCCTTTNTCTCTCCCNCCTCACAATCTATGTCTTCGCCTTCAACATCGT 




FIGURE 221 

GTTGTGTCCTTCAGCAAAACAGTGGATTTAAATCTCCTTGCAC^ 

ATCAGGAAAGAAAGAAAGAAAAAAACCGAACCTGACAAAAAAGAAGAAAAAGAAGAAGAAAAAAA 

AT CATGA AAACCATCCAGCCAAAAATGCACAATTCTATCTCTTGGGCAATCTTC^ 

TGCTCTGTGTCTCTTCCAAGGAGTGCCCGTGCGCAGCGGAQATGCCACCTTCCCCAAAGCTAT^ 

ACAACGTGACGGTCCGGCAGGGGGAGAGCGCCACCCTCAGGTGCACTATTGACAACCGGGTC^ 

CGGGTGGCCTGGCTAAACCGCAGCACCATCCTCTATGCTGGGAATGACAAGTGGTGCCT^ 

TCGCGTGGTCCTTCTGAGCAACACCCAAACGCAGTACAGCATCGAGATCCAGAACGTGGATGTGT 

ATGACGAGGGCCCTTACACCTGCTCGGTGCAGACAGACAACCACCCAAAGACCTCTAGGGTCCAC 

CTCATTGTGCAAGTATCTCCCAAAATTGTAGAGATTTCTTCAGATATCTCCATTAATGAAGGGAA 

CA?VTATTAGCCTCACCTGCATAGCAACTGGTAGACCAGAGCCTACGGTTACTTGGAG 

CTCCCAAAGCGGTTGGCTTTGTGAGTGAAGACGAATACTTGGAAATTCAGGGCATCACCCGGGAG 

CAGTCAGGGGACTACGAGTGCAGTGCCTCCAATGACGTGGCCGCGCCCGTGGTACGGAGAGTAAA 

GGTCACCGTGAACTATCCACCA.TACATTTCAGAAGCCAAGGGTACAGGTGTCCCCGTG^ 

AGGGGACACTGCAGTGTGAAGCCTCAGCAGTCCCCTCAGCAGAATTCCAGTGGTACAAGGATGAC 

AAAAGACTGATTGAAGGAAAGAAAGGGGTGAAAGTGGAAAACAGACCTTTCCTCTCAAAACTCAT 

CTTCTTCAATGTCTCTGAACATGACTATGGGAACTACACTTGCGTGGCCTCCAACAAGCTGGGCC 

ACACCAATGCCAGCATCATGCTATTTGGTCCy^GGCGCCGTCAGCGAGGTGAGC^ 

AGGAGGGCAGGCTGCGTCTGGCTGCTGCCTCTTCTGGTCTTGCACCTGCTTCTCAAATTTTC 

TGAGTGCCACTTCCCCACCCGGGAAAGGCTGCCGCCACCACCACCACCAA^^ 

AACACCGACAGCAACCAATCAGATATATACAAATGAAATTAGAAGAAACACAGCCTC^ 

GAAATTTGAGGGAGGGGAACAAAGAATACTTTGGGGGGAAAAGAGTTTTAAAAAAGAAA^ 

ATTGCCTTGCAGATATTTAGGTACAATGGAGTTTTCTTTTCCCAAACGGGAAGAACAC^ 

CCGGCTTGGACCCACTGCAAGCTGCATCGTGCAACCTCTTTGGTGCCAGTGTGGGCAAGGGCTC^ 

GCCTCTCTGCCCACAGAGTGCCCCCACGTGGAACATTCTGGAGCTGGCCATCCCAAATTCAATCA 

GTCCATAGAGACGAACAGAATGAGACCTTCCGGCCCAAGCGTGGCGCTGCGGGCACTTTGGTAGA 

CTGTGCCACCACGGCGTGTGTTGTGAAACGTGAAATAAAAAGAGCAAAAAAAAA 




FIGURE 222 

MKTIQPKMHNSISWAIFTGLAALCLFQGVPWSGDATFPKAMDNVTWQGESATLRCTIDN^ 

VAWLNRSTILYAGNDKWCLDPRWLLSNTQTQYSIEIQNVDVYDEGPYTCSVQTDNHPKTSRVHL 

IVQVSPKIVEISSDISINEGNNISLTCIATGRPEPTVTWRHISPKAVGFVSEDEYLEIQGITREQ 

SGDYECSASNDVAAPVWRVKVTVNYPPYISEAKGTGVPVGQKGTLQCEASAVPSAEFQWYKDDK 

RL I EGKKGVKVENRPFLS KL I FFNVSEHDYGNYTCVASNKLGHTNAS IMLFGPGAVSEVSNGTSR 

RAGCVWLLPLLVLHLLLKF 



Signal peptide: 
amino acids 1-28 



FIGURE 223 

GAAAAAAAATGATGAAAACCATCCAGCCAA2\AATGCACAATTCTATCTCT 

GGGCTGGCTGCTCTGTGTCTCTTCCAAGGAGTGCCCGTGCGCA.GCGGAGATGCCACCTTCCCC7^ 
AGCTATGGACAACGTGACGGTCCGGCAGGGGGAGAGCGCCACCCTCAGGTGCACTATTG^ 
GGGTCACCCGGGTGGCCTGGCTAAACCGCAiSCTVCCATCCTCTATGCTGGGAATGAC^ 
CTGGATCCTCGCGTGGTCCTTCTGAGCAACACCCTVAACGCT^GTACAGCATCGAGATCCA^ 
GGATGTGTATGACGAGGGCCCTTACACCTGCTCGGTGCAGACAGACAACCACCCAAAGACCTCTA 
GGGTCCACCTCATTGTGC^GTATCTCCCAAAATTGTAGAGATTTCTTCA 
' GAAGGGAACAATATTAGCCTCACCTGCATAGCAACTGGTAGACCAGAG 



FIGURE 224 



ATGGCTGGTGACGGCGGGGCCGGGCAGGGGACCGGGGCCGCGGCCCGGGAGCGGGCCAGCTGCCGGGAGCCCT^ 
ACCGCCTGGCCCGACTCCACCATGAACGTCGCGCTGCAGGAGCTGGGAGCTGGCAGCAACGTGGGATTCCAGAAGGGG 
ACAAGACAGCTGTTAGGCTCACGCACGCAGCTGGAGCTGGTCTTAGCAGGTGCCTCTCTACTGCTGGCTGCACTGCTT 
CTGGGCTGCCTTGTGGCCCTAGGGGTCCAGTACCACAGAGACCCATCCCACAGCACCTGCCTTACAGAGGCCTGCATT 
CGAGTGGCTGGAAAAATCCTGGAGTCCCTGGACCGAGGGGTGAGCCCCTGTGAGGACTTTTACCAGTTCTCCTGTGGG 
GGCTGGATTCGGAGGAACCCCCTGCCCGATGGGCGTTCTCGCTGGAACACCTTCAACAGCCTCTGGGACCAAAACCAG 
GCCATACTGAAGCACCTGCTTGAAAACACCACCTTCAACTCCAGCAGTGAAGCTGAGCAGAAGACACAGCGCTTCTAC 
CTATCTTGCCTACAGGTGGAGCGCATTGAGGAGCTGGGAGCCCAGCCACTGAGAGACCTCATTGAGAAGATTC^^ 
TGGAACATTACGGGGCCCTGGGACCAGGACAACTTTATGGAGGTGTTGAAGGCAGTAGCAGGGACCTACA 
CCATTCTTCACCGTCTACATCAGTGCCGACTCTAAGAGTTCCAACAGCAATGTTATCCAGGTGGACCAGTC 
TTTCTGCCCTCTCGGGATTACTACTTAAACAGAACTGCCAATGAGAAAGTGCTCACTGCCTATCTGGATTAGATG^^ 
GAACTGGGGATGCTGCTGGGTGGGCGGCCCACCTCCACGAGGGAGCAGATGCAGCAGGTGCTGGAGTTGGAGATACAG 
CTGGCCAACATCACAGTGCCCCACSGACCAGCGGCGCGACGAGGAGAAGATCTACCACAAGA^^^ 
CAGGCTCTGGCGCCCTCCATGGACTGGCTTGAGTTCCTGTCTTTCTTGCTGTCACCATTGGAGTTGAGTGA 
CCTGTGGTGGTGTATGGGATGGATTATTTGCAGCAGGTGTCAGAGCTCATCAACCGCACGGAACCAAGCATC^ 
AATTACCTGATCTGGAACCTGGTGCAAAAGACAACCTCT^GCCTGGACCGACGCTTTGAGTCTGCACAAGAGAAGCTC 
CTGGAGACCCTCTATGGCACTAAGAAGTCCTGTGTGCCGAGGTGGCAGACCTGCATCTCCAACTVCGGATGACGCCCT^ 
GGCTTTGCTTTGGGGTCACTCTTCGTGAAGGCCACGTTTGACCGGCAAAGCAAAGAAATTGCAGAGGGGATGATCAGC 
GAAATCCGGACCGCATTTGAGGAGGCCCTGGGACAGCTGGTTTGGATGGATGAGAAGACCCGCCAGGCAGCCAAGGAG 
AAAGCAGATGCCATCTATGATATGATTGGTTTCCCAGACTTTATCCTGGAGCCCAAAGAGCTGGATGATGTTTATGAC 
GGGTACGAAATTTCTGAAGATTCTTTCTTCCAAAACATGTTGAATTTGTACAACTTCTCTGCCAAGGTTATGGCTGAC 
CAGCTCCGCAAGCCTCCCAGCCGAGACCAGTGGAGCATGACCCCCCAGACAGTGAATGCCTACTACCTTCCAACTAAG 
AATGAGATCGTCTTCCCCGCTGGCATCCTGCAGGCCCCCTTCTATGCCCGCAACCACCCCAAGGCCCTGAACTTCGGT 
GGCATCGGTGTGGTCATGGGCCATGAGTTGACGCATGCCTTTGATGACCAAGGGCGCGAGTATGACAAAGAAGGGAAC 
CTGCGGCCCTGGTGGdAGAATGAGTCCCTGGCAGCCTTCCGGAACCACACGGCCTGCATGGAGGAACAGTACAATCAA 
TACCAGGTCAATGGGGAGAGGCTCAACGGCCGCCAGACGCTGGGGGAGT^CATTACTGACAACGGGGGGCTGAAGGCT 
GCCTACAATGCTTACAAAGCATGGCTGAGAAAGCATGGGGAGGAGCAGCAACTGCCAGCCGTGGGGCTCACCAACCAC 
CAGCTCTTCTTCGTGGGATTTGCCCAGGTGTGGTGCTCGGTCCGCACACCAGAGAGCTCTCACGAGGGGCTGGTGACC 
GACCCCCACAGCCCTGCCCGCTTCCGCGTGCTGGGCACTCTCTCCAACTCCCGTGACTTCCTGCGGCACTTCGGCTGC 
CCTGTCGGCTCCCCCATGAACCCAGGGCAGCTGTGTGAGGTGTGGTAGACCTGGATCAGGGGAGAAATGGCCAGCTGT 
CACCAGACCTGGGGCAGCTCTCCTGACAAAGCTGTTTGCTCTTGGGTTGGGAGGAAGCAAATGCAAGCTGGGCTGGGT 
■CTAGTCCCTCCCCCCCACAGGTGACaVTGAGTACAGACCCTCCTCAATCACCACATTGTGCCTCTGCTTTGGGG^ 
CCTGCCTCCAGCAGAGCCCCCACCATTCACTGTGACATCTTTCCGTGTCACCCTGCCTGGA^^ 
GGCCAGTTCCCATAGGAAGGAGTCTGCC 



FIGURE 225 

MNVALQEl^GSNVGFQKGTRQLLGSRTQLELVLAGASLLLAALLLGCLVALGVQYHRDPSHSTC 
LTEACIRVAGKILESLDRGVSPCEDFYQPSCGGWIRRNPLPDGRSRWNTFNSLWDQNQAILKHLL 
ENTTFNSSSEAEQKTQRFYLSCLQVERIEELGAQPLRDLIEKIGGWNITGPWDQDNFMEVLKAVA 
GTYRATPFFTVYISADSKSSNSNVIQVDQSGLFLPSRDYYLNRTANEKVLTAYLDYMEELGMLLG 
GRPTSTREQMQQVLELEIQLANITVPQDQRRDEEKIYHKMSISELQALAPSMDWLEFLSFLLSPL 
ELSDSEPVWYGMDYLQQVSELINRTEPSILNNYLIVINLVQKTTSSLDRRFESAQEKLLETLYGT 
KKSCVPRWQTCISNTDDALGFALGSLFVKATFDRQSKEIAEGMISEIRTAFEEALGQLVWMDEKT 
RQAAKEKADAIYDMIGFPDFILEPKELDDVYDGYEISEDSFFQNMLNLYNFSAKVMADQLRKPPS 
RDQWSMTPQTVNAYYLPTKNEIVFPAGILQAPFYARNHPKALNFGGIGWMGHELTHAFDDQGRE 
YDKEGNLRPWWQNESLAAFRNHTACMEEQYNQYQVNGERLNGRQTLGENITDNGGLKAAYNAYKA 
WLRKHGEEQQLPAVGLTNHQLFFVGFAQVWCSVRTPESSHEGLVTDPHSPARFRVLGTLSNSRDF 

LRHFGCPVGS PMNPGQLCEVW 

Type II Transmeinbrsuie domain: 

amino acids 32-57 



FIGURE 226 



GCCCGGCCCTCCGCCCTCCGCACTCCCGCCTCCCTCCCTCCGCCCGCTCCCGCGCCCTCCTCCCTCCCTCCTCCCCAG 

CTGTCCCGTTCGCGTCATGCCGAGCCTCCCGGCCCCGCCGGCCCCGCTGCTGCTCCTCGGGCTGCTGCTGCTCGGCTC 

CCGGCCGGCCCGCGGCGCCGGCCCAGAGCCCCCCGTGCTGCCCATCCGTTCTGAGAAGGAGCCGCTGCCCGTTCGGGG 

AGCGGCAGGTAGGTGGGCGCCCGGGGGAGGCGCGGGCGGGGAGTCGGGCTCGGGGCGAGTCAGCGCCAGCCCGGAGG^ 

GGCGCGGGGCGCAGGTGGCTCGGCGCGGCGGGCGGCCCGGAGGGTGGGCGGGGGCAGAAGGGCGCGGTGCCTGGGACC 

CGGGACCCGCGGGCAGCCCCCGGGGCGGCACACGGCGCGAGCTGGGCAGCGGCCTCCAGCCAAGCCCGTCCCCGCAGG 

CTGCACCTTCGGCGGGAAGGTCTATGCCTTGGACGAGACGTGGCACCCGGACCTAGGGGAGCCATTCGGGGTGATGCG 

CTGCGTGCTGTGCGCCTGCGAGGCGCAGTGGGGTCGCCGTACCAGGGGCCCTGGCAGGGTCAGCTGCAAGAACATCAA 

ACCAGAGTGCCCAACCCCGGCCTGTGGGCAGCCGCGCCAGCTGCCGGGACACTGCTGCCAGACCTGCCCCCAGGACTT 

CGTGGCGCTGCTGACAGGGCCGAGGTCGCAGGCGGTGGCACGAGCCCGAGTCTCGCTGCTGCGCTCTAGCCTCCGCTT 

CTCTATCTCCTACAGGCGGCTGGACCGCCCTACCAGGATCGGCTTCTCAGACTCCAATGGCAGTGTCCTGTTTGAGCA 

CCCTGCAGCCCCCACCCAAGATGGCCTGGTCTGTGGGGTGTGGCGGGCAGTGCCTCGGTTGTCTCTGCGGCTCCTTAG 

GGCAGAACAGCTGCATGTGGCACTTGTGACACTCACTCACCCTTCAGGGGAGGTCTGGGGGCCTCTC^^ 

GGCCCTGTCCCCAGAGACCTTCAGTGCCATCCTGACTCTAGAAGGCCCCCACCAGCAGGGCGTAGGGGGCATCACCCT 

GCTCACTCTCAGTGACACAGAGGACTCCTTGCATTTTTTGCTGCTCTTCCGAGGCCTTGCAGGACTAACCCAGGT^ 

CTTGAGGCTCCAGATTCTACACCAGGGGCAGCTACTGCGAGAACTTCAGGCCAATGTCTCAGCCCAGGAACCAGGCTT 

TGCTGAGGTGCTGCCCAACCTGACAGTCCAGGAGATGGACTGGCTGGTGCTGGGGGAGCTGCAGATGGCCCTGGAGTG 

GGCAGGCAGGCCAGGGCTGCGCATCAGTGGACACATTGCTGCCAGGAAGAGCTGCGACGTCCTGCAAAGTGTCCTTTG 

TGGGGCTAATGCCCTGATCCCAGTCCAAACGGGTGCTGCCGGCTCAGCCAGCCTCACTCTGCTAGGAAATGGCaJJCCCT 

GATCCTCCAGGTGCAATTGGTAGGGACAACCAGTGAGGTGGTGGCCATGACACTGGAAACCAAGCCTCAGCGGAGGGA 

TCAGCCCACTGTCCTGTGCCACATGGCTGGCCTATCCTCCCCTGCCCCCAGGCCGTGGGTATCTGCCCTGGGCTGGGG 

TGCCCGAGGGGCTCATATGCTGCTGCAGAATGAGCTCTTCCTGAACGTGGGCACCAAGGACTTCCCAGACGGAGAGCT 

TCGGGGGCAACGTGGCTGCCCTGCCCTACTGTGGGGCATAGCGCCCGCCCTGCCCGTGCCCCTAGCAGGAGCCCTGGT 

GCTACCCCCTGTGAAGAGCCAAGCAGCAGGGCACGCCTGGCTTTCCTTGGATACCCACTGTCACCTGCACTATGAAGT 

GCTGCTGGCTGGGCTTGGTGGCTCAGAACAAGGCACTGTCACTGCCCACCTCCTTGGGCCTCCTGGAACGCCAGGGCC 

TCGGCGGCTGCTGAAGGGATTCTATGGCTCAGAGGCCCAGGGTGTGGTGAAGGACCTGGAGCCGGAACTGCTGCGGCA 

CCTGGCAAAAGGCATGGCTTCCCTGATGATCACCACCAAGGTAGCCCCAGAGGGGAGCTCCGAGGGCAGCCTCTCCTC 

CCAGGTGCACATAGCCAACCAATGTGAGGTTGGCGGACTGCGCCTGGAGGCGGCCGGGGCCGAGGGGGTGCGGGCGCT 

GGGGGCTCCGGATACAGCCTCTGCTGCGCCGCCTGTGGTGCCTGGTCTCCCGGCCCTAGCGCCCGCCAAACCTGGTGG 

TCCTGGGCGGCCCCGAGACCCCAACACATGCTTCTTCGAGGGGCAGCAGCGCCCCCACGGGGCTCGCTGGGCGCCCAA 

CTACGACCCGCTCTGCTCACTCTGCACCTGCCAGAGACGAACGGTGATCTGTGACCCGGTGGTGTGCCCACCGCCCAG 

CTGCCCACACCCGGTGCAGGCTCCCGACCAGTGCTGCCCTGTTTGCCCTGGCTGCTATTTTGATGGTGACCGGAGCTG 

GCGGGCAGCGGGTACGCGGTGGCACCCCGTTGTGCCCCCCTTTGGCTTAATTAAGTGTGCTGTCTGCACCTGCAAGCA 

GGGGGGCACTGGAGAGGTGCACTGTGAGAAGGTGCAGTGTCCCCGGCTGGCCTGTGCCCAGCCTGTGCGTGTCAACCC 

CACCGACTGCTGCAAACAGTGTCCAGGTGAGGCCCACCCCCAGCTGGGGGACCCCATGCAGGCTGATGGGCCCCGGGG 

CTGCCGTTTTGCTGGGCAGTGGTTCCCAGAGAGTCAGAGCTGGCACCCCTCAGTGCCCCCGTTTGGAGAGATGAGCTG 

TATCACCTGCAGATGTGGGGTAAGTGGGGAGCAGAGGCTTGTGTGAGGTGGGTACTGGGAGCCTGGTCTGGAGTAGGG 

AGACCTTCCCAGGGAGGTCCCTGAAGAAGCTGAAGGTCACTGTGTCCCAGTGCCTCTGGGGGACACTCAGTGTCTGCT 

CTGTCTTGTACCAGGCAGGGGTGCCTCACTGTGAGCGGGATGACTGTTCACTGCCACTGTCCTGTGGCTCGGGGAAGG 

AGAGTCGATGCTGTTCCCGCTGCACGGCCCACCGGCGGCGTAAGTGAGGGAGTCCAGGGTCAGCAGCTGTGAGTGGAG 

GGCTCACCTGCCTGTGGGACTCCTGATCAGGGAAGGGAGCACTCACTGTGTGCAGGAACAGTGCAGCCTGCCTCACAA 

GTGCCATTCCAATCCACCCTCACAGCAACCTGGTGGAATTGTTATTTATGACCTTTTCTTTACAAATGAGATTTCTGA 

AGCTCAGAGAAATTAAGCAACGAGATGAAGGTCACCCAGCTGTGTGCACTGACCTGTTTAGAAAATACTGGCCTTTCT 

GGGACCAAGGCAGGGATGCTTTGCCCTGCCCTCTATGCCTCTCTGTGCCTCTCCACTCCCTCTCCCCTCCTCCAACAT 

TCCCTCCCTTCTGTCTCCAGCAGCCCCAGAGACCAGAAGTGATCCAGAGCTGGAGAAAGAAGCCGAAGGCTCTTAGGG 

AGCAGCCAGAGGGCCAAGTGACCAAGAGGATGGGGCCTGAGCTGGGGAAGGGGTGGCATCGAGGACCTTCTTGCATTC 

TCCTGTGGGAAGCCCAGTGCCTTTGCTCCTCTGTCCTGCCTCTACTCCCACCCCCACTACCTCTGGGAACCACAGCTC 

CACAAGGGGGAGAGGCAGCTGGGCCAGACCGAGGTCACAGCCACTCCAAGTCCTGCCCTGCCACCCTCGGCCTCTGTC 

CTGGAAGCCCCACCCCTTTCTTCCTGTACATAATGTCACTGGCTTGTTGGGATTTTTAATTTATCTTCACTCAGCACC 

AAGGGCCCCGGACACTCCACTCCTGCTGCCCCTGAGCTGAGCAGAGTCATTATTGGAGAGTTTTGTATTTATTAAAAC 

ATTTCTTTTTCAGTCTTTGGGCATGAGGTTGGCTCTTTGTGGCCAGGAACCTGAGTGGGGCCTGGTGGAGAAGGGGCN 

GAGAGTAGGAGGTGAGAGAGAGGAGCTCTGACACTTGGGGAGCTGAAAGAGACCTGGAGAGGCAGAGGATAGCGTGGC 

NNTTGGCTGGCATNCCTGGGTTCCGCAGAGGGGCTGGGGATGGTTCTTGAGATGGTCTAGAGACTCAAGAATTTAGGG 

AAGTAGAAGCAGGATTTTGACTCAAGTTTAGTTTCCCACATCGCTGGCCTGTTTGCTGACTTCATGTTTGAAGTTGCT 

CCAGAGAGAGAATCAAAGGTGTCACCAGCCCCTCTCTCCCTCCTTCCCTTCCCTTCCCTTTCTTTCCCTCCCCTCCCC 

TCCCCTCCCCTCCCCTCC 



FIGURE 227 

CMCCGAGCCK3<K3GTGCTGCGCCK3CGGCCGTGATGGCTGGTGACC3^ 

CXSGCCGCGGCCCGGGAGCGGGCCAGCTGCCGGGAGCCCTGAATCACCGCCTGGCCCGACTCC^^ 

ATGAACGTCGCGCTGCAGGAGCTGGGAGCTGGCAGCAACGTGGGATTCCAGAAGGGGACA^^ 

GGTGTTAGGCTCACGCACGCAGCTGGAGCTGGTCTTAGCAGGTGCCTCTCTACTGCTGGCTGC^^ 

TGCTTCTGGGCTGCCTTGTGGCCCTAGGGGTCCAGTACCACAGAGACCCATCCCACAGCACCTGC 

CTTACAGAGGCCTGCATTCGAGTGGCTGGAAAAATCCTGGAGTCCCTGGACCGAGGGGTGAGCCC 

CTGT6AGGACTTTTACCAGTTCTCCTGTGGGGGCTGGATTCGGAGGAACCCCCTGCCCGATGGGC 

GTTCTCGCTGGAACACCTTCAACAGCCTCTGGGACCAAAACCAGGCCATACTGAAGCACCTGCTT 

GAAAACACCACCTTCAACTCCAGCAGTGAAGCTGAGCAGAAGACACAGCGCTTCTACCTATCTTG 

CCTACAGGTGGAGCGCATTGAGGAGCTGGGAGCCCAGCCACTGAGAGACCTCATTGAGAAGATTG 

GTGGTTGGAACATTACGGGGCCCTGGGACCAGGACAACTTTATGGAGGTGTTGAAGGCAGTAGCA 

GGGACCTACAGGGCCACCCCATTCTTCACCGTCTACATCAGTGCCGACTCTAAGAGTTCCAACAG 

CAATGTTATCCAGGTGGACCAGTCTGGGCTCTTTCTGCCCTCTCGGGATTACTACTTAAACAGAA 

CTGCCAATGAGAAAGTAAGGAACATCTTCCGAACCCCCATCCCTACCCCTGGCTGAGCTGGGCTG 

ATCCCTGTTGACTTTTCCCTTTGCCAAGGGTCAGAGCAGGGAAGGTGAGCCTATCCTGTCACCTA 

GTGAACAAACTGCCCCTCCTTTCTTTCTTCTTTTCTTCCTCCCTCCCTCCCTTTCTTC^^^^ 

CCTTCCTTCCTTCCTCTTATTCTTCTAGTAGGTTTCATAGACACCTACTGTGTGCCAGGTCCAGT 

GGGGGAATTCGGAGATATAAGTTTCCGAGCCATTGCCAaVGGAAGCGTTCAGTGTCGAT^ 

ATGGACCTAGATAGGCTGATAACAAAGCTCACAAGAGGGTCCTGAGGATTCAGGAGAGACTT^^^ 

GAGCCAGCAAAGTCTTCCTGAAGAGATTGCATTTGAGCCAGGTCCTGTAG 



FIGURE 228 

ATGCCTACTACCTTCCAACTAAGT^TGAGATCGTCTTCCCCGCTGGCATCCTGCAGG^^ 

TATGCCCGCAACCACCCCAAGGCCCTGAACTTCGGTGGCATCGGTGTGGTCATGGGCCA^^^ 

GACGCATGCCTTTGATGACCAAGGGCGCGAGTATGACAAAGAAGGGAACCTGCGGCCCT^ 

AGAATGAGTCCCTGGCAGCCTrCCGGAACCACACXKK:CTGCAT^GGAACAGTACAATCAA^^^ 

CAGGTCAATGGGGAGAGGCTCAACGGCCGCCAGACGCTGGGGGAGAACATTGCTGACAA^ 

GCTGAAGGCTGCCTACAATGCTTACAAAGCATGGCTGAGAAAGCATGGGGAGGAGCAGC^^ 

CAGCCGTGGGGCTCACCAACCACCAGCTCrrCTTCGTGGGATTTGCCCAGGTGT^^^^ 

CGCACACCAGAGAGCTCTCACGAGGGGCTGGTGACCGACCCCCACAGCCCTGCCCGCTTCCGCGT 

GCTGGGCACTCTCTCCAACTCCCGTGACTTCCTGCGGCACTTCGGCTGCCCTGTCGGCTCCCCCA 

TGAACCCAGGGCAGCTGTGTGAGGTGTGGTAGACCTGGATCAGGGGAGAAATGGCCAGCTGTCAC 

CAGACCTGGGGCAGCTCTCCTGACAAAGCTGTTTGCTCTTGGGTTGGGAGGAAGCAAATGCA^^ 

TGGGCTGGGTCTAGTCCCTCCCCCCCACAGGTGACATGAGTACAGACCCTCCTCAATCACCACAT 

TGTGCCTCTGCTTTGGGGGTGCCCCTGCCTCCAGCAGAGCCCCCACCATTCACTGTGACATCTTT 

CCGTGTCACCCTGCCTGGAAGAGGTCTGGGTGGGGAGGCCAGTTCCCATAGGAAGGAGTCTGCCT 

CTTGTGTCCCCAGGCTCACTCAGCCTGGCGGCCATGGGGCCTGCCGTGCCTGCCCCACTG'KSACC 

CACAGGCCTGGGTGGTGTACCTCCTGGACTTCTCCCCAGGCTCACTCAGTGCGCACTTAGGGGTG 

GACTCAGCTCTGTCTGGCTCACCCTCACGGGCTACCCCCACCTCACCCTGTGCTCCTTGTGCC^^ 

TGCTCCCAGTGCTGCTGCTGACCTTCACTGACAGCTCCTAGTGGAA6CCCAAGGGCCTCTGAAAG 

CCTCCTGCTGCCCACTGTTTCCCTGGGCTGAGAGGGGAAGTGCATATGTGTAGCGGGTACTGGTT 

CCTGTGTCTTAGGGCACAAGCCTTAGCAAATGATTGATTCTCCCTGGACAAAGCAGGA^ 

TAGAGCAGGGAAAAGGAAGAACAGAGTTTATTTTTACAGAAAAGAGGGTGGGAGGGTGTGGTOT 

GGCCCTTATAGGACC 



FIGURE 229 



GGGACACGGCGGTGCTTACK3TGTTATTTGGAAGATGGAGCTTCAAAGGGTGC 

A.TAC 
CAGA 
TGAC 
ATTT 

"GCGGAAAATGCTGTGTCATTCCCAGATGTGAaH^KKOTAAAAGTTGTTGT^^ 



TCAAGTATTA?T™cSGaiS^TSATAAGT^^^ 

GACTACAGCCl 
GACTCAACAT-f 
ACATCTCAAAT 



GAATTGCTCATTATGTAAATACTTTAATTC 
CAGTACACATTGTCCTTTTTTTAAGACX5TC 

TGATTTCATGTTTGTAATCTACAACTTTTC.*.^ ^^^^^^^ TvoiTvr.RTvnr'aTTr 

^^SSaou^cgaatattgcagtgartstgtgattc^ 

AGTTCCCTGrrTCAATfiAGRGTCA&TCCaCATOTRCaM 



TTGTGATGAGAGACAATTTAATAACAGTAT 



SSS^^aSSScStSttttttttaaattttttctgcctat^ 

AAT 



TATGGGACGTTTGTCAAAAAAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 230 

MMLLVQGACCSNQWIJ^VLLSLCCLLPSCLPAGQSVDFPWJ^VDNMMVRKGDTAVLRCYLED^^ 
KGAWLNRSSIIFAGGDKWSVDPRVSISTLNKRDYSLQIQNVDVTDDGPYTCSVQTQHTPRTMQVH 
LTVQVPPKIYDISNDMTVNEGTNVTLTCLATGKPEPSISWRHISPSAKPFENGQYLDIYGITRDQ 
AGEYECSAENAVSFPDWKVKVVVNFAPTIQEIKSGTVTPGRSGLIRCEGAGVPPPAFEWYKGEK 
KLFNGQQGIIIQNFSTRSILTVTOVTQEHFGNYTCVAANKLGTTmSLPLNPPSTAQYGITGSA^ 

VLFSCWYLVLTLSSFTSIFYLKNAILQ 

Important features of the protein: 
Signal peptide: 

amino acids 1-31 

Transmembrsuie domain :~ 

amino acids 326-345 

N-glycosylation sites. 

amino acids 71-75, 153-157, 273-277, 284-288, 292-296, 305-309 

Casein kinase II phosphorylation site. 

amino acids 147-151, 208-212, 224-228 

Tyrosine kinase phosphorylation site. 

amino acids 178-186 

N-myristoylation sites. 

-7 -lo ^-x in 67-73 151-157, 239-245, 291-297, 
amino acids 7-13, 63-70, 6/ /J, ^ox x^/, 

302-308, 319-325 



Myelin PC protein: 

amino acids 92-121 



FIGURE 231 



AGTGGTTCGATGGGAAGGATCTTTCTCCAAGTGGTTCCTCTTGAG^ 
CAGGACTTTGCKZaVTCrATAAAGCTTGGCAATGAGAAATAACa^ 
CTTGAGTGAGACCCAACAAGCTGCITTTCACCAAATTGCAATGGAGCCTTTCGA^ 
CAAAGCCCAAGAGGAGAAATGGGGTGAACTTCTCCCTAGCTGTGGTGGTCATCTACCT^^^ 

CTCACCGCTGGCGCTCGGCTGCTGGTGGTCCAAGITCTGAATCTGCAGGCGCGG^^ 

GGAGATGTATTTCCTCAATGACACTCTGGCGGCTGAGGACAGCCCGTCCTTCTCCTTGCTG^^^ 

CAGCACACCCTGGAGAACACCTGGCTCAGGGTGCATCGAGGCTGCAAGTCCTGCAGGCCCAACTC 

ACCTGGGTCCGCGTCAGCCATCAGCACTTGCTGCAGCGGGTAGACAACTTCACTCAGAACCCAGG 

GATGTTCAGAATCAAAGGTGAACAAGGCGCCCC^GGTCTTCAAGGTCACAAGGGGGCC^^ 

TGCCTGGTGCCCCTGGCCCGCCGGGACCACCTGCTGAGAAGGGAGCCAAGGGGGCTATGGGACGA 

GATGGAGCAACAGGCCCCTCGGGACCCCAAGGCCCACCGGGAGTCAAGGGAGAGGCGGGCCTCCA 

AGGACCCCAGGGTGCTCCAGGGAAGCAAGGAGCCACTGGCACCCCAGGACCCCAAGGAGAGAAGG 

GCAGCAAAGGCGATGGGGGTCTCATTGGCCCAAAAGGGGAAACTGGAACTAAGGGAGAG^ 

GACCTGGGTCTCCCAGGAAGCAAAGGGGACAGGGGCATGAAAGGAGATGCAGGGGTCATCGGGCC 

TCCTGGAGCCCAGGGGAGTAAAGGTGACTTCGGGAGGCCAGGCCCACCAGGTTTGGCTGGTTTTC 

CTGGAGCTAAAGGAGATCAAGGACAACCTGGACTGCAGGGTGTTCCGGGCCCTCCTGGTGCAGTG 

GGACACCCAGGTGCCAAGGGTGAGCCTGGCAGTGCTGGCTCCeCTGGGCGAGCAGGACITCCAGG 

GAGCCCCGGGAGTCCAGGAGCCACAGGCCTGAAAGGAAGCAAAGGGGACACAGGACTTCAAGGAC 

AGCAAGGAAGAAAAGGAGAATCAGGAGTTCCAGGCCCTGCAGGTGTGAAGGGAGAACAGGGGAGC 

CCAGGGCTGGCAGGTCCCAAGGGAGCCCCTGGACAAGCTGGCCAGAAGGGAGACCAGGGAGTGAA 

AGGATCTTCTGGGGAGCAAGGAGTAAAGGGAGAAAAAGGTGAAAGAGGTGAAAACTCAGTGTCCG 

TeAGGATTGTCGGCAGTAGTAACCGAGGCCGGGCTGAAGTTTACTACAGTGGTACCTGGGGGACA 

ATTTGCGATCACGAGTGGCAAAATTCTGATGCCATTGTCTTCTGCCGCATGCTGG^^ 
AGGAAGGGCCCTGTACAAAGTGGGAGCTGGCACTGGGCAGATCTGGCTGGATAATGTTCAGTGTC 

GGGGCACGGAGAGTACCCTGTGGAGCTGCACCAAGAATAGCTGGGGCCATCATGACTGCAG^ 

GAGGAGGACGCAGGCGTGGAGTGCAGCGTCTGACCCGGAAACCCTTTCACTTCTCTGCTCCC^^ 

GTGTCCTCGGGCTCATATGTGGGAAGGCAGAGGATCTCTGAGGAGTTCCCTGGGGACAACTGAGC 

AGCCTCTGGAGAGGGGCCATTAATAAAGCTCAACATCATTGA 



FIGURE 232 

></usr/seqdb2/sst/DNA/Dnaseqs.full/ss.DNA68886 

xsubunit 1 of 1, 520 aa, 1 stop 
xMW: 52658, pi: 9.16, NX(S/T): 3 

Transmembrane domain: 

amino acids 47-66 (type II) 

N-glycosylation sites. 

amino acids 43-47, 83-87, 136-140 

Tyrosine kinase phosphorylation site- 

amino acids 432-440 

N-myristoylation sites, 

amino acids 41-47, 178-184, 253-259, 274-280, 340-346, 346-352, 
400-406, 441-447, 475-481, 490-496, 515-521 



Amidation site. 

amino acids 360-364 

Leucine zipper pattern. 

amino acids 56-78 

Speract receptor repeat 

amino acids 422-471, 488-519 

Clq domain proteins. 

amino acids 151-184, 301-334, 316-349 



FIGURE 233 



CCCACGCGTCCGAAGGCAGACAAAC3GTTCATTTGTAAAGAAGCTCCTTCCAGC^^^^ 
CTCCTTTTGCCCAAACTCACCCAGTGAGTGTGAGCATTTAAGAAGCATCCTCTGCC^ 

AGGAAAGAAGAAAAAGGGCCAAAAGCCAAAATGAAACTGAT^^^ 

AACTTTGCTGCTAGGAGTTCAAGCCATGCCTGCAAATCGCCTCTCTTGCTACAG^^ 

AAGATCACAACTCTCACAACCTTCCGGAAGGAGTAGCTGACCTGAC^CAGATTGATGTCA^^^ 

CAGGATCATTTCTGGGATGGGAAGGGATGTGAGATGATCTGTTACTGCAACTTCAGCG^^^ 

CTGCTGCCCAAAAGACGTTTTCTTTGGACCAAAGATCTCTTTCGTGATTCC™^ 

GAGAATCTTCATGTATTCTGGAGAACACCAITCCTGATTTCCCACAAACTG^^^ 

IICTGCATTTCTAGTTTCTATATAGTGCAATAGAGCATAGATTCTATAAATTCTTACTTGTC^^ 

PACAAGTAAATCTGTGTTAAACAAGTAGTAATAAAAGTTAATTCAATCTAAAAAAAAAA^ 



FIGURE 234 



</usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA52758 

<s\abunit 1 of 1, 98 aa, 1 stop 
<MW: 11081, pi: 6.68, NX(S/T): 1 

MKLMVLVFTIGLTLLLGVQAMPi^RLSCYRKILKDHNCHNLPEGVA^^^ 
EMICYCNFSELLCCPKDVFFGPKISFVIPCNNQ 

Important features : 
Signal peptide: 

amino acids 1-20 

N-glycosylation site. 

amino acids 72-76 

Tyrosine kinase phosphorylation site. 

amino acids 63-71 



FIGURE 235 

g?SS?^S?SI?5^?cSgtggaaagaatttggcctggattctgt^ 

SS?S^??^tcSACCCAAATAAGACTCATCCCAACTACATCTCAATAAT^^ 

S^?^a?gaSttttcScacat^ 

IcS?S^SAASSAivCTTCTTTAAATTGGAACGGGACATG^ 

SSt^tgccSmStgggaaagttttcagaggaaataagg^^ 

S??^SaS?5?™tSactccgaccctc^ 
?S?SSS?tggaScttcctggaggtggtgtccagcgtggaaatatc 

SSSSct^?^?SSSacccagcaaatgaatatgcttataggc^^^ 
?^™5otc?^tS?cctgttcatccaattg^ 

IJSSOTScS^GCASAcSkTAGCAGCT^^^ 

S^cSSS5?S?gS?Sttctacacaaaaagtcaa^^^^^ 

S^SSSSAiTTASATGTGATAGGTACTCT 

??otS^S^caccgggactcatgggtgtttggtggtatt^^^ 

ot5?^^?5g?S^agSttggaacactgaaaaaggaagggtgga^ 

???^?^gSSStgcagaagaatttggtcttcttggttctactgagtg^ 

StcAagactccttcaagagcgtggcgtggct^^^ 

tISot?tgISSttgaSgtacaccgctgatgtacagctt^^^ 

g^gc??tStgaaggctttgaaggcaaatctctttatgaaag^^ 

pS^^^SS^SgtggStSccaggataagcaaattgggatct^ 

S^^^SS^SS^ttSggcagagc^cggtatactaaaaattggga^ 

^Ir^SA^JcACTGTATCACAGTGTCTATGAAA^^^ 

S^5ot5t^5a?cSc??aSgtc^^ 

??SSS?I???CTATGSACATC(:y^CAGGAAATGAA^^ 

5?^SSSISgIatttSSgaaattccttccaagttcagt^^^^ 

in^G?S???^T^TASAAGAATGATGAATGATCAACTCATGTTTCT^ 

?£^?S^?AGGGTTScAkcAGGCCTTTTTAT^^ 
SGSTCC^^^GT^TTCCCAGGAATTTATGATGCTCTGrn^^ 

Sa????t^?^^agaagtgaagagacagatttatgttgcagccttcacagtgc^ 

agSgSg^?tttgagtgaagtagcctaagaggat^ 

^^^atc^^ctJaSLgaatcgtaatgggtatattgat^^ 

ataaagttgaatattatatataa 



FIGURE 236 



></usr/seqdb2/sst/DNA/Dnaseqs . f ull/ss . DNA52756 
xsubunit 1 of 1, 750 aa, 1 stop 
xMW: 84305, pi: 6.93, NX(S/T): 10 

MWNLmETDSAVATARRPRWLCAGALVLAGGFPLLGFLFGWFIKSSNEATNITPKHNMKAFLDEL 

KAENIKKFLHNFTQIPHLAGTEQNFQLAKQIQSQWKEFGLDSVELAHYDVLLSYPNKTHPNYISI 

INEDGNEIFNTSLFEPPPPGYENVSDIVPPFSAFSPQGMPEGDLVYVNYARTEDFFKLERDMKIN 

CSGKIVIARYGKVFRGNKVKNAQLAGAKGVILYSDPADYFAPGVKSYPDGWNLPGGGVQRGNILN 

LNGAGDPLTPGYPANEYAYRRGIAEAVGLPSIPVHPIGYYDAQKLLEKMGGSAPPDSSWRGSLKV 

PYNVGPGFTGNFSTQKVKMHIHSTNEVTRIYNVIGTLRGAVEPDRYVILGGHRDSWVFGGIDPQS 

GAAWHEIVRSFGTLKKEGWRPRRTILFASWDAEEFGLLGSTEWAEENSRLLQERGVAYINADSS 

lEGNYTLRVDCTPLMYSLVHNLTKELKSPDEGFEGKSLYESWTKKSPSPEFSGMPRISKLGSGND 

FEVFFQRLGIASGRARYTKNWETNKFSGYPLYHSVYETYELVEKFYDPMFKYHLTVAQVRGGMVF 

ELANSIVLPFDCRDYAWLRKYADKIYSISMKHPQEMKTYSVSFDSLFSAVKNFTEIASKFSERL 

QDFDKSNPIVLRMMNDQLMFLERAFIDPLGLPDRPFYRHVIYAPSSHNKYAGESFPGIYDALFDI 

ESKVDPSKAWGEVKRQIYVAAFTVQAAAETLSEVA 

Signal sequence: 

amino acids 1-40 

N-glycosylation sites. 

amino acids 76-80, ' 121-125, 140-144, 153-157, 195-199, 336-340, 
459-463, 476-480, 638-642 

Tjyrosine kinase phospliorylation sites. 

amino acids 363-372, 605-613, 606-613, 617-626 

M-myristoylation sites. 

amino acids 85-91, 168-174, 252-258. 256^262, 282-288, 335-341, 
360-366, 427-433, 529-535, 707-713 

Wi\DOCS\BGY\BGV-2104.DOC 
030702 



